May Mobility logo
May Mobility Verified
Automotive, Autonomous Vehicles, Robotics, Artificial Intelligence

Senior Data Scientist

United StatesHybridFull TimeSenior$182,000–$266,000 /yrPosted 2 months agoVisa sponsorship available

Is this role right for you?

Upload your resume and get a skill-by-skill breakdown — see exactly where you match, where you're close, and what to highlight. Not a mystery percentage.

Get a tailored resume highlighting what this role needs.

Role summary

May Mobility is seeking a Senior Data Scientist to develop automated methods for tagging data collected by their autonomous vehicles. This role involves designing, implementing, and deploying state-of-the-art machine learning models for analyzing multimodal data to generate searchable metadata and facilitate downstream engineering workflows. The Senior Data Scientist will also curate high-quality datasets, research novel techniques for feature extraction and learning, and establish frameworks for model validation and performance monitoring. The ideal candidate has expert proficiency in deep learning, machine learning, Python, deep learning frameworks (TensorFlow/PyTorch), PySpark, and experience with production ML systems and MLOps.

### Who you are
- Expert proficiency in designing and implementing deep learning architectures for multimodal data for offline analysis
- Strong understanding of data labeling best practices, label consistency, and performance metrics specifically relevant to large-scale auto-tagging accuracy and dataset curation
- Expertise in machine learning, with hands-on experience in the design, training, and evaluation of a wide range of algorithms
- Awareness of the latest advancements in the field, with the ability to translate innovative concepts into practical solutions for May
- Excellent problem-solving skills with a meticulous approach to model architecture and optimization
- B.S, M.S. or Ph.D. Degree in Engineering, Data Science, Computer Science, Math, or a related quantitative field
- 5+ years of hands-on experience as a Data Scientist or ML Engineer with a strong focus on algorithm design and machine learning
- Expert-level programming skills in Python with extensive use of modern deep learning frameworks like TensorFlow or PyTorch
- Demonstrated experience in building and deploying production-level machine learning systems from conception to delivery
- Expertise in PySpark/Apache Spark for handling large-scale data processing
- Background in robotics or autonomous systems
- Experience working with multimodal data like visual data (images/video), structured perception and behavior outputs (e.g., agent tracks, vehicle state estimation, motion planner outputs)
- Solid understanding of ML deployment lifecycle, MLOps practices, and cloud computing platforms (e.g., AWS, GCP)
- Don’t meet every single requirement? Studies have shown that women and/or people of color are less likely to apply to a job unless they meet every qualification
- At May Mobility, we’re committed to building a diverse, inclusive, and authentic workforce, so if you’re excited about this role but your previous experience doesn’t align perfectly with every qualification, we encourage you to apply anyway! You may be the perfect candidate for this or another role at May

### What the job involves
- May Mobility is experiencing a period of significant growth as we expand our autonomous shuttle and mobility services nationwide
- We are seeking talented data scientists and machine learning engineers to develop automated methods for tagging data collected by our autonomous vehicles
- This will enable us to generate valuable insights from our data, making it easily searchable for triaging issues, creating test sets, and building datasets for autonomy improvements
- Design, implement, and deploy state-of-the-art machine learning models for analyzing multimodal data to generate searchable metadata and facilitating downstream engineering workflows such as quick issue triaging
- Curate high-quality datasets for evaluation and training to ensure model robustness, performance, and coverage
- Research and implement novel techniques for sequential feature extraction, weak supervision, and self-supervised learning to efficiently handle long-tail events and continuously improve labeling data quality
- Establish and maintain frameworks for model validation and performance monitoring to drive continuous improvement

### Benefits
- Onsite, Hybrid, and Remote Work Options

Ready to apply?
You'll be redirected to May Mobility's application page.

Similar roles