Software Engineer
Role summary
Benchmark is seeking a Software Engineer to build AI-native features end-to-end, from design through production, focusing on their LLM infrastructure. The role involves working directly with retrieval, context management, memory, embeddings, and evals to create seamless user experiences. The engineer will architect and ship new product features, improve LLM stack components, and integrate model capabilities into workflows. This is an individual contributor role with significant ownership, requiring 3+ years of experience in production application development and a genuine interest in AI agents. The position is hybrid, requiring 4+ days per week in the NYC office.
About Us
Benchmark is the AI platform for the world's best investment firms. Leading firms use Benchmark to work faster and smarter across their entire deal lifecycle — from sourcing to diligence to portfolio management.
The Role
You'll build AI-native features end-to-end, from design through production. That means working directly with our LLM infrastructure — retrieval, context management, memory, embeddings, evals — and turning it into product experiences that feel effortless to users. We believe AI should be a teammate, not a copilot. We're a small team shipping ambitious products, so you'll have real ownership from day one.
Things You Would Work On
- Architect and ship new product features that help investment professionals move faster, with a focus on removing complexity and enabling collaboration across deal teams
- Build on our LLM stack: run evals, improve retrieval, context and memory management, and integrate model capabilities into user-facing workflows
What We're Looking For
- 3+ years of experience building and shipping production applications
- Genuine interest in agents and keeps up to date with current research and model capabilities
- Self-motivated, high ownership and low ego with the ability to work through ambiguity
- Excited to work in-person in our NYC office 4+ days/wee
Bonus
- Experience working with agents in production
- Previous experience as a founder or at an early-stage startup
Tech Stack
Backend: Python, Flask, Postgres · Frontend: TypeScript, React · Infra: GCP
Why Us
- We're a small, technical team where everyone in every role builds. We work in-person because we're trying to do something hard with a lean team, and the velocity we get from being together matters. If that excites you, we'd love to talk.
Similar roles
Senior Software EngineerNorthside Hospital · Atlanta, Georgia, United States · Onsite- Senior Software EngineerRandstad Digital Americas · North York, Ontario, Canada · Hybrid
Software EngineerConcord Servicing, LLC · Dallas, Texas, United States · Remote
Lead Software EngineerElanco · Lake County, Indiana, United States · Onsite
Software EngineerAMERICAN SYSTEMS · Fredericksburg, Virginia, United States · Onsite