Formation Bio logo
Formation Bio Verified
Biotechnology, Pharmaceuticals, Artificial Intelligence, Software Development

Data Scientist

New York, New York, United StatesHybridFull TimeEntry-level (exp-based)$154,500–$202,000 /yrPosted 2 months agoVisa sponsorship available

Is this role right for you?

Upload your resume and get a skill-by-skill breakdown — see exactly where you match, where you're close, and what to highlight. Not a mystery percentage.

Get a tailored resume highlighting what this role needs.

Role summary

Formation Bio is seeking a Data Scientist to join their platform prediction team. This role involves translating AI-driven probability of success predictions into measurable portfolio outcomes within the drug development space. You will architect core systems for portfolio construction, risk monitoring, and performance attribution, integrating quantitative finance, healthcare data, and AI/ML. Responsibilities include designing risk frameworks, running backtesting experiments, and building dashboards to communicate performance to stakeholders. The ideal candidate has an MS/PhD in a quantitative field, 1-3 years of experience in a quantitative role, strong Python skills, and a solid understanding of portfolio construction and risk concepts. Experience with backtesting frameworks, healthcare data, or AI/ML pipelines is preferred.

### Who you are
- MS or PhD in a quantitative field (statistics, finance, physics, computational science, engineering, or related)
- 1-3 years in a quantitative research, data science, or analytics role — finance, healthcare, academic research, or consulting all count; substantive internships qualify
- Strong Python programming skills with experience in data-intensive workflows (pandas, numpy, scipy)
- Solid grasp of core portfolio construction and risk concepts: position sizing, rebalancing, Sharpe ratio, drawdown, volatility, benchmark comparison
- Demonstrated ability to work with messy, real-world datasets — comfortable with data wrangling, deduplication, and quality assessment
- Clear communicator who can present quantitative results to both technical peers and business stakeholders
- Experience with backtesting frameworks or portfolio simulation (vectorbt, Backtrader, or custom implementations)
- Exposure to healthcare, pharma, or biotech data (clinical trials, claims data, -omics, real-world evidence)
- Familiarity with alternative data in a research or investment context
- Experience with probability-of-success modeling, drug development decision analysis, or health economics
- Comfort with LLMs or AI/ML pipelines in a production or research setting
- Familiarity with dashboard/visualization tools (Streamlit, Plotly, Dash) and pipeline orchestration (Dagster, Airflow)

### What the job involves
- As a Data Scientist on the platform prediction team, you'll translate our probability of success predictions into measurable portfolio-level outcomes
- You'll architect core systems — order management, execution simulation, portfolio construction, risk monitoring, and performance attribution — that let us rigorously evaluate signals from our AI-driven predictions in public and private equities and our internal portfolio
- This role sits at the intersection of quantitative finance, healthcare data, and AI-driven drug development
- If you're excited about applying portfolio construction and risk management fundamentals to one of the most consequential prediction problems in healthcare, this is the role.No other company — hedge fund or pharma — has a technical data science position translating drug development experience into durable AI-native portfolio strategies
- The skills you develop here — portfolio construction over assets with radically asymmetric risk profiles, clinical trial analytics, AI/ML in production, and risk management across multi-year horizons — can directly impact the delivery of new and effective therapeutics to patients by best aligning impactful medicines with economic incentives
- Work with the team to implement and maintain core portfolio engine: order management system, execution simulation layer, portfolio construction service, and performance tracking
- Design risk frameworks that quantify exposure across a portfolio of drug development bets with radically different risk profiles, timelines, and failure modes
- Run rigorous backtesting experiments with strict temporal constraints to evaluate Formation strategies against baseline approaches and measure marginal signal from new evidence sources
- Coordinate across the organization to integrate internal Formation data sources (clinical trial data, genomic evidence, real-world data) and proprietary tooling into portfolio analytics pipelines
- Work with product and engineering teams to build dashboards and reporting that communicate portfolio performance, risk metrics, and strategy comparisons to both technical and executive stakeholders
- Collaborate with the broader data science team to ensure portfolio-level evaluation feeds back into model improvement and evidence prioritization

### Benefits
- Flexible Time Off: We have a flexible PTO policy as we trust our team members to take the time they need to recharge.
- Parental Leave: We believe your personal life enriches your professional one. We offer 16 weeks of paid parental leave for birthing parents and 12 weeks for non-birthing parents so you can start or grow your family.
- Comprehensive Benefits: All full time employees have access to health, vision, and dental insurance. Additionally, we provide pre-tax commuter benefits, access to a 401k plan, short and long-term disability and life insurance.
- Education & Career Development Resources: Formation Bio offers a plethora of professional development resources to employees. Some examples include fireside chats with industry experts, conference passes, certification courses, internal mobility opportunities, in depth manager training, leadership coaching program, and more.
- Strategic Hiring Hubs: Formation Bio teams are hybrid and based in four key hubs: New York City, Boston, the San Francisco Bay Area, and North Carolina’s Research Triangle. Our bright and collaborative NYC headquarters is conveniently located near major transit hubs and stocked with drinks, coffee, snacks, and a weekly lunch.
- Community: Our teams collaborate closely to achieve our mission, supporting each other across functions and priorities. We build connection through team-building events, quarterly company gatherings, a company-wide retreat every two years, and Employee Resource Groups.

Ready to apply?
You'll be redirected to Formation Bio's application page.

Similar roles