
Senior Data Scientist
Role summary
The Senior Data Scientist will be responsible for developing and maintaining automated testing frameworks for data pipelines and machine learning models. This role involves designing and executing test cases, monitoring data quality, and performing exploratory data analysis to uncover issues. The ideal candidate will collaborate with cross-functional teams, leverage real-world and synthetic data for testing, and coordinate human-in-the-loop and A/B tests. Proficiency in Python, SQL, testing frameworks, machine learning libraries, and statistical testing principles is required, along with familiarity with CI/CD and version control.
Key Responsibilities:
- Develop and maintain automated testing frameworks for data pipelines and machine learning models.
- Design and execute test cases to validate statistical models, algorithms, and data transformations.
- Monitor data quality, detect anomalies, and ensure consistency across datasets.
- Collaborate with data scientists, engineers, and QA teams to define test strategies and acceptance criteria.
- Perform exploratory data analysis to uncover hidden issues in data or model behavior.
- Leverage real world data and build synthetic datasets to simulate edge cases, stress-test models, ensure unbiased predictions, and verify data security
- Coordinate with end users to run human in the loop and A/B tests
- Document test results, bugs, and performance metrics to support continuous improvement
Required Qualifications:
- Bachelor’s or Master’s degree in Computer Science, Data Science, Statistics, or a related field.
- 3+ years of experience in data science and AI/ML testing
- Proficiency in Python, SQL, and testing frameworks (e.g., PyTest, unittest).
- Experience with machine learning libraries (e.g., scikit-learn, TensorFlow, XGBoost).
- Strong understanding of statistical testing, model validation, and data integrity principles.
- Familiarity with CI/CD pipelines and version control (e.g., Git, Jenkins).
- Proficiency in Python, SQL, and testing frameworks (e.g., PyTest, unittest).
- Experience with machine learning libraries (e.g., scikit-learn, TensorFlow, XGBoost).
- Strong understanding of statistical testing, model validation, and data integrity principles.
Preferred Skills:
- Experience using Oracle AI Data Platform / Oracle Cloud Infrastructure (OCI) including Medallion architecture
- Strong mastery of SQL
- Knowledge of MLOps and model monitoring tools
- Familiarity with Azure Dev Ops (ADO) for test management
- Excellent communication and documentation skills
Galent is an equal opportunity employer. All employment decisions are based on qualifications, merit, and business needs. The company does not discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, age, or any other characteristic protected by applicable law.
Similar roles
Senior Data ScientistAviva Canada · Toronto, Ontario, Canada · Hybrid- Sr. Data ScientistBurtch Works · Reading, Pennsylvania, United States · Onsite
Data ScientistMANTECH · Ashburn, Virginia, United States · Hybrid
Junior Data ScientistApplied Research Associates, Inc · Fort Belvoir, Virginia, United States · Onsite- Data ScientistBrooksource · Houston, Texas, United States · Onsite