
Karumi Verified
Software Development, Mobile Development, IT Consulting
AI/ML Applied Engineer
New York, New York, United StatesOnsiteFull Time$120–$170 /hrPosted 2 months agoHidden Gem · YC Startup
Role summary
Join an AI engineering team to build the core intelligence of a platform, focusing on voice AI, browser automation, and large language models. This role involves designing voice experiences, creating browser agents for web application interaction, and optimizing LLM behavior for production reliability. Responsibilities include building and optimizing speech AI systems, implementing browser automation with computer vision, engineering prompt systems, creating evaluation frameworks, integrating multimodal AI, and managing real-time AI pipelines. The ideal candidate has production experience with LLMs, speech AI, browser automation, strong Python skills, and an understanding of prompt engineering and observability.
## The Opportunity
Join our AI engineering team in the US to build the core intelligence behind our platform. You'll work at the intersection of voice AI, browser automation, and large language models - creating agents that can listen, speak, navigate interfaces, and interact naturally with users in real-time.
This role combines cutting-edge AI with practical systems work. You'll design voice experiences, build browser agents that understand and control web applications, and optimize LLM behavior for production reliability. We ship working AI features that solve real problems, balancing innovation with pragmatic constraints.
_We sponsor visas for qualified candidates._
## Core Responsibilities
\- Build and optimize voice AI systems using speech-to-text and text-to-speech models
\- Design browser agents that navigate, understand, and interact with web applications
\- Implement browser automation with computer vision and DOM understanding
\- Engineer prompt systems and LLM workflows for consistent, intelligent behavior
\- Create evaluation frameworks to measure voice quality, agent accuracy, and user experience
\- Integrate multimodal AI - combining voice, vision, and language understanding
\- Build real-time AI pipelines where latency and reliability are critical
\- Manage the AI Infrastructure and take care of it
\- Monitor and improve AI system performance in production environments
## Technical Requirements
\- Production experience with LLMs (OpenAI, Anthropic, or open-source models)
\- Hands-on work with speech AI (STT/TTS systems like Deepgram, ElevenLabs, Whisper)
\- Experience with browser automation (Playwright, Puppeteer, Selenium) or computer vision
\- Strong Python skills with async programming and real-time systems
\- Understanding of prompt engineering, retrieval systems, and agent frameworks
\- Ability to debug complex AI behaviors and build observability tools
\- Software engineering fundamentals for production AI systems
## Nice to Have
\- Experience building autonomous agents or multi-step AI workflows
\- Knowledge of computer vision for UI understanding and visual grounding
\- Fine-tuning or training language models for specialized tasks
\- Real-time audio processing and streaming architectures
\- Background in NLP, machine learning research, or AI systems
## Why Karumi
\- Meaningful equity stake in a backed, fast-growing company\\
* Work on cutting-edge voice AI and browser agents in production\\
* Shape how AI systems interact with users and software interfaces\\
* Small team with direct impact on core product capabilities\\
* Gym
* Visa sponsorship available
Join our AI engineering team in the US to build the core intelligence behind our platform. You'll work at the intersection of voice AI, browser automation, and large language models - creating agents that can listen, speak, navigate interfaces, and interact naturally with users in real-time.
This role combines cutting-edge AI with practical systems work. You'll design voice experiences, build browser agents that understand and control web applications, and optimize LLM behavior for production reliability. We ship working AI features that solve real problems, balancing innovation with pragmatic constraints.
_We sponsor visas for qualified candidates._
## Core Responsibilities
\- Build and optimize voice AI systems using speech-to-text and text-to-speech models
\- Design browser agents that navigate, understand, and interact with web applications
\- Implement browser automation with computer vision and DOM understanding
\- Engineer prompt systems and LLM workflows for consistent, intelligent behavior
\- Create evaluation frameworks to measure voice quality, agent accuracy, and user experience
\- Integrate multimodal AI - combining voice, vision, and language understanding
\- Build real-time AI pipelines where latency and reliability are critical
\- Manage the AI Infrastructure and take care of it
\- Monitor and improve AI system performance in production environments
## Technical Requirements
\- Production experience with LLMs (OpenAI, Anthropic, or open-source models)
\- Hands-on work with speech AI (STT/TTS systems like Deepgram, ElevenLabs, Whisper)
\- Experience with browser automation (Playwright, Puppeteer, Selenium) or computer vision
\- Strong Python skills with async programming and real-time systems
\- Understanding of prompt engineering, retrieval systems, and agent frameworks
\- Ability to debug complex AI behaviors and build observability tools
\- Software engineering fundamentals for production AI systems
## Nice to Have
\- Experience building autonomous agents or multi-step AI workflows
\- Knowledge of computer vision for UI understanding and visual grounding
\- Fine-tuning or training language models for specialized tasks
\- Real-time audio processing and streaming architectures
\- Background in NLP, machine learning research, or AI systems
## Why Karumi
\- Meaningful equity stake in a backed, fast-growing company\\
* Work on cutting-edge voice AI and browser agents in production\\
* Shape how AI systems interact with users and software interfaces\\
* Small team with direct impact on core product capabilities\\
* Gym
* Visa sponsorship available