MeeBoss logo
MeeBoss Verified
Software Development, Information Technology & Services

Senior Data Engineer

United StatesHybridFull TimeSenior$115,000–$195,000 /yrPosted 2 months ago

Is this role right for you?

Upload your resume and get a skill-by-skill breakdown — see exactly where you match, where you're close, and what to highlight. Not a mystery percentage.

Get a tailored resume highlighting what this role needs.

Role summary

Absentia Labs is seeking a Senior Data Engineer to architect and lead the design of their biomedical data platform. This role involves owning the evolution of data systems, defining schema-driven models for complex biomedical data (chemical, biological, omics, etc.), and establishing best practices for data quality and governance. The engineer will build and maintain cloud-native infrastructure, design batch and streaming pipelines for ML workloads, and partner with cross-functional teams. The ideal candidate has 5+ years of experience in data/platform engineering, strong Python skills, deep cloud platform expertise (AWS, GCP, Azure), and experience supporting ML/AI workloads. This is a hands-on, high-autonomy role in a research-driven environment.

About Absentia Labs

Absentia Labs is building the data and intelligence infrastructure that powers the next generation of biomedical discovery. We work at the intersection of biology, chemistry, machine learning, and large-scale systems, transforming fragmented scientific data into reliable, machine-learning-ready knowledge.

Biomedical data is dispersed, semi-structured, and inherently noisy, yet deeply interconnected across experiments, assays, compounds, and biological systems. Extracting value from this complexity requires deliberate schema design, principled abstractions, and rigorous post-processing pipelines that can support both scientific reasoning and large-scale AI

.We believe breakthroughs start with strong data foundations. This role sits at the architectural core of our platform, shaping how scientific data is modeled, validated, versioned, and served across the organization

**.
The Ro**

leAs a Senior Data Engineer, you will own the design and evolution of Absentia Labs’ biomedical data platform. You will operate with a high degree of autonomy, making long-horizon architectural decisions while remaining hands-on in implementatio

n.This role is ideal for an engineer who enjoys working in high-ambiguity, research-driven environments, and who understands that data engineering for AI is as much about representation and correctness as it is about scal

**e.
What You’ll**

  • DoArchitect and lead the design of end-to-end data systems for large-scale biomedical datasets (chemical, biological, toxicology, omics, assay, clinical, and experimental dat
  • a).Define and evolve schema-driven data models that reconcile noisy, semi-structured, and heterogeneous sources into coherent, interoperable representatio
  • ns.Establish best practices for data quality, validation, provenance, lineage, and versioning suitable for scientific and ML workflo
  • ws.Build and maintain cloud-native data infrastructure (data lakes, warehouses, object storage, streaming systems) with an emphasis on scalability and reliabili
  • ty.Design pipelines that support both batch and streaming access for ML training, evaluation, and inferen
  • ce.Partner closely with ML engineers, scientists, and product leads to translate research needs into durable data abstractio
  • ns.Make principled trade-offs around performance, cost, flexibility, and correctness in production syste
  • ms.Provide technical leadership through design reviews, architectural guidance, and mentorship of other enginee
  • rs.Identify and proactively address systemic risks in data integrity, scalability, and operational complexi

**ty.
Who You**

AreYou are a data engineer who thinks in systems and interfaces, not just pipelines. You are comfortable owning poorly defined problems and converging on robust solutions through thoughtful design and iterat

ion.You understand that biomedical data is rarely “clean,” and that schema design, normalization, and semantics are first-order engineering problems—especially in AI-driven setti

**ngs.
You Likely**

  • Have5+ years of experience in data engineering, platform engineering, or ML infrastructure roles, with clear ownership of production sys
  • tems.Proven experience designing and operating large-scale, production-grade data pipel
  • ines.Strong proficiency in Python and data-centric software engineering pract
  • ices.Deep experience with cloud platforms (AWS, GCP, or Azure), including storage, compute, and security primit
  • ives.Familiarity with distributed data processing and orchestration systems (e.g., Spark, Beam, Ray, Airflow, Dags
  • ter).Experience supporting ML/AI workloads, including dataset generation, feature pipelines, and reproducible training workf
  • lows.Strong architectural judgment and the ability to communicate technical decisions clearly across discipl

**ines.
Bonus If Yo**

  • u HaveCompetitive compensation, including meaningful equity participation, allows you to share directly in the long-term success and growth of the co
  • mpany.Prior work with biomedical or life-science data (e.g., omics, assays, molecular representations, clinical or toxicology
  • data).Experience with streaming platforms (Kafka, Pub/Sub, Kin
  • esis).Exposure to ontology-aware data modeling or schema evolution in scientific do
  • mains.Infrastructure-as-code and systems experience (Terraform, Docker, Kubern
  • etes).Experience in early-stage startups or research-heavy environ
  • ments.Open-source contributions or technical publica

**tions.
What W**

  • e OfferA chance to architect the data backbone of an AI-driven biomedical pl
  • atform.Direct impact on how scientific data is translated into machine intell
  • igence.High autonomy, high trust, and ownership over critical s
  • ystems.Flexible remote or hybrid work arrang
  • ements.A deeply technical, low-ego culture focused on learning and

**rigor.
About**

MeeBoss:MeeBoss is a startup revolutionizing the online recruiting industry. Our mission is to streamline the hiring process, empowering both job seekers and employers to connect effe

ctively.
By leveraging cutting-edge technology, MeeBoss is transforming the way companies attract, engage, and hire top talent. Our solutions are tailored to the needs of modern businesses through direct chat with job-ready talent and to connect job seekers to the person behind the job, delivering a seamless, personalized experience - anywhere, anytime. All postings we share are carried out with the prior consent and partnership of our

clients.

Ready to apply?
You'll be redirected to MeeBoss's application page.

Similar roles