Senior Data Scientist
Role summary
A leading integrated healthcare innovator is seeking a Senior ML/GenAI Engineer to bridge research and production systems. This role involves architecting MLOps pipelines, deploying LLM solutions, and leveraging Python, PyTorch, and LangChain to build clinical copilots and RAG-based knowledge systems. Responsibilities include end-to-end design, optimization of MLOps infrastructure on cloud platforms (Azure/AWS), and developing AI toolsets for healthcare workflows. Experience with Big Data, SQL, and distributed systems is essential, with a preference for healthcare datasets or regulated environments. The position offers a competitive salary, fully remote flexibility, and comprehensive benefits.
We're working with a leading integrated healthcare innovator transforming patient outcomes through large-scale AI on this exciting opportunity.
As a Senior ML/GenAI Engineer, you will bridge the gap between experimental research and production-grade systems, architecting robust MLOps pipelines and deploying cutting-edge LLM solutions. You will be at the forefront of the AI shift in healthcare, leveraging Python, PyTorch, and LangChain to build everything from clinical copilots to advanced RAG-based knowledge systems.
The Role
- Lead the end-to-end design and deployment of scalable GenAI solutions, from initial prototyping and prompt engineering to production monitoring and guardrail implementation.
- Architect and optimize MLOps infrastructure using Kubernetes, CI/CD, and ML platforms (Azure/AWS) to ensure seamless model orchestration and high-availability systems.
- Develop internal and client-facing AI toolsets including search-knowledge assistants, summarization tools, and conversational analytics to automate healthcare workflows.
- Build and refine RAG (Retrieval-Augmented Generation) pipelines, managing embedding strategies and high-performance Vector Databases (e.g., LlamaIndex).
- Collaborating with cross-functional software and data engineering teams to integrate AI models directly into EHR platforms like EPIC.
What You'll Need
- 5+ years of experience building production-grade software/ML systems, with a minimum of 1 year dedicated to LLMs and Generative AI.
- Mastery of Python and deep learning stacks (PyTorch, Transformers) alongside experience with LangChain, orchestration tools (MLFlow, Kubeflow), and Vector DBs.
- Proven track record of delivering production RAG systems, including expertise in prompt engineering, evaluation frameworks (guardrails), and latency optimization.
- Strong background in Big Data and SQL environments (Postgres, Redshift, Snowflake) and distributed systems architecture.
- Specific experience with healthcare datasets (EHR/EPIC) or highly regulated environments is strongly preferred.
What's On Offer
- Competitive compensation range of $91,416 - $152,380 based on impact and experience.
- Fully remote flexibility across 29 eligible states with a focus on work-life harmony.
- Comprehensive benefits including $10k Student Debt Pay Down, $10k Adoption/Fertility reimbursement, and $5,250 annual tuition assistance.
- Strong career growth paths with funding for professional certifications and continuing education.
Apply via Haystack today!
Similar roles
Senior Data ScientistAviva Canada · Toronto, Ontario, Canada · Hybrid- Sr. Data ScientistBurtch Works · Reading, Pennsylvania, United States · Onsite
Data ScientistMANTECH · Ashburn, Virginia, United States · Hybrid
Junior Data ScientistApplied Research Associates, Inc · Fort Belvoir, Virginia, United States · Onsite- Data ScientistBrooksource · Houston, Texas, United States · Onsite