Data Scientist
Role summary
Commence is seeking a Data Scientist to leverage advanced analytics and machine learning within the healthcare sector. This role involves analyzing complex datasets (EHRs, claims, FHIR), building predictive models (including Generative AI), and communicating insights through visualizations. The Data Scientist will collaborate with cross-functional teams, ensure regulatory compliance (HIPAA, 42 CFR Part 2), and work with cloud-based environments (AWS, Azure). The ideal candidate will have a Bachelor's degree, a minimum of 4 years of experience, proficiency in Python/R/SQL, and practical experience deploying AI/ML models to production. Familiarity with healthcare data, regulations, and big data tools is essential.
Description
At Commence, we’re the start of a new age of data-centric transformation, elevating health outcomes and powering better, more efficient process to program and patient health. We combine quality data-driven solutions that fuel answers, technology that advances performance, and clinical expertise that builds trust to create a more efficient path to quality care.
With human-centered, healthcare-relevant, and value-based solutions, we create new possibilities with data. We provide proof beyond the concept and performance beyond the scope with a focus on efficiencies that transform the lives of those we serve. With a culture driven by purpose, straightforward communication and clinical domain expertise, Commence cuts straight to better care.
Requirements
The Data Scientist combines advanced analytics and machine learning with deep domain knowledge to generate clinical and operational insights, build predictive models, and support decision-making across the healthcare ecosystem. The ideal candidate will be adept at working with complex datasets (e.g., EHRs, claims, FHIR), collaborating cross-functionally, and ensuring data use aligns with regulatory and ethical standards.
- Prepare and analyze large and complex datasets to identify trends, patterns, and insights that drive business decisions.
- Develop, implement, and optimize machine learning models, including Generative AI, for predictive analytics, classification, and other applications.
- Apply statistical techniques to analyze data and build models that forecast future trends and behaviors (e.g., risk stratification, disease progression, readmission likelihood).
- Create compelling data visualizations and dashboards to effectively communicate findings and insights to stakeholders.
- Connect and match within and across data sources both internal/ external to augment and enhance analyses.
- Work closely with cross-functional teams, including data engineers, software developers, and business analysts, to gather requirements and deliver data solutions.
- Stay current with industry trends and advancements in data science and integrate new techniques and tools into existing workflows. Identify, review, and execute impactful analytical approaches from industry whitepapers.
- Translate complex data into actionable insights to support clinical decision-making, care optimization, and operational efficiency, often through dashboards or reports.
- Collaborate with clinicians, informaticists, and SMEs to derive relevant features from health data, such as comorbidity indices, lab value trajectories, or time-to-treatment measures.
- Ensure data use complies with HIPAA, 42 CFR Part 2, and other applicable regulations. Apply de-identification, data masking, or differential privacy techniques when needed.
- Help define and sometimes implement workflows for data acquisition, preprocessing, and model inference pipelines, often in cloud-based environments (e.g., AWS, Azure).
- Identify and mitigate potential biases in data or models, and ensure outputs are interpretable by clinical or policy stakeholders.
- Monitor model performance over time and retrain or recalibrate as necessary to maintain accuracy and relevance in evolving clinical environments.
Basic Qualifications
- Bachelor’s degree in Data Science, Computer Science, Statistics, Mathematics, or a related field; Master’s or PhD preferred.
- Minimum of 4 years of experience in data science or a related field.
- Proficiency in programming languages such as Python, R, or SQL.
- Demonstrated experience practically leveraging and deploying AI/ML models to production.
- Working knowledge of Generative AI tuning and implementation techniques and toolsets i.e. AWS Bedrock, LangChain, Anthropic MCP, and LlamaIndex.
- Experience with big data frameworks/ toolsets such as Apache Spark, Databricks, and AWS EMR Studio.
- Experience with AI/ machine learning libraries and frameworks (e.g., TensorFlow, Scikit-Learn, PyTorch, and Spark ML Flow).
- Experience working with healthcare datasets such as EHRs, medical claims, FHIR, HL7, or patient-reported outcomes
- Familiarity with healthcare regulations and standards (e.g., HIPAA, 42 CFR Part 2, HEDIS, CMS measures)
- Demonstrated experience working with Notebook tools such as Databricks/ Jupyter.
- Strong understanding of statistical analysis and modeling techniques.
- Experience with data visualization tools (e.g., Amazon Quicksight, Power BI, Matplotlib).
- Excellent problem-solving skills and attention to detail.
- Strong communication and interpersonal skills, with the ability to work effectively with diverse teams and stakeholders.
Preferred Qualifications
- AI/ ML certifications.
- Familiarity with Databricks, Snowflake, or other modern data platforms
- Understanding of data governance and security frameworks relevant to healthcare (e.g., NIST, HITRUST)
- Prior experience working with government agencies (e.g., CMS, VA, DoD) or payer/provider organizations
- Knowledge of healthcare delivery systems and policy frameworks
Commence is an equal employment opportunity employer. All personnel processes are merit-based and applied without discrimination on the basis of race, color, religion, sex, sexual orientation, gender identity, marital status, age, disability, national or ethnic origin, military and veteran status or any other characteristic protected by applicable law.
If you need assistance or an accommodation due to a disability, you may contact us at 757-306-4920 or hr@commence.ai
Similar roles
Senior Data ScientistAviva Canada · Toronto, Ontario, Canada · Hybrid- Sr. Data ScientistBurtch Works · Reading, Pennsylvania, United States · Onsite
Data ScientistMANTECH · Ashburn, Virginia, United States · Hybrid
Junior Data ScientistApplied Research Associates, Inc · Fort Belvoir, Virginia, United States · Onsite- Data ScientistBrooksource · Houston, Texas, United States · Onsite