We're in beta · Starting with US & Canada · Shipping weekly — your feedback shapes RiseMe
Haystack logo
Haystack Verified
Software, Developer Tools, Analytics

Senior AI Engineer

San Diego, California, United StatesRemoteFull TimeSenior$111,300–$166,900 /yrPosted 1 month ago

Compensation estimateAI

See base, equity, bonus, and total comp estimates for this role — free, no credit card.

Sign up to see compensation estimate

Senior AI Platform Engineer | San Diego, California | $111,300 - $166,900

We're working with a global leader in wireless innovation and semiconductor technology on this exciting opportunity.

Shape the future of edge computing and intelligence by building the high-performance infrastructure that powers next-gen Generative AI. We are looking for an expert to architect robust platforms for LLM hosting, agentic workflows, and intensive ML workloads at a massive global scale.

The Role

• Lead the deployment and optimization of Large Language Models (LLMs) using AWS Bedrock, GCP Vertex, and Azure AI Foundry.

• Architect and manage production-grade Kubernetes clusters, focusing on GPU scheduling, autoscaling, and high availability for AI/ML workloads.

• Scale agentic workflow orchestration systems (like n8n) and manage large-scale semantic search via Elasticsearch vector solutions.

• Design comprehensive observability stacks using Prometheus, Grafana, and OpenTelemetry to monitor multi-cloud performance and latency.

• Automate entire environments using Infrastructure as Code (Terraform, Helm) and build robust CI/CD pipelines for AI-driven automation.

What You'll Need

• 5–7 years of deep experience in Platform Engineering, MLOps, or SRE roles with a focus on cloud-native AI systems.

• Advanced proficiency in Kubernetes, including hands-on experience with GPU-accelerated clusters and complex autoscaling.

• Expertise in multi-cloud architecture across AWS (SageMaker), Google Cloud, and Azure to deliver secure, highly-available services.

• Strong Python development skills combined with Linux systems administration and a deep understanding of cloud security/IAM.

• Proven track record hosting/serving LLMs at scale; familiarity with vLLM, Triton, KServe, or Ray Serve is a major plus.

What's On Offer

• Highly competitive salary ($111k - $166k) plus annual discretionary bonuses and RSU grant potential.

• Flexible, remote-friendly culture at the cutting edge of the global AI and semiconductor revolution.

• Comprehensive benefits package designed for work-life harmony, including premium healthcare and wellness support.

Apply via Haystack today!

Ready to apply?
You'll be redirected to Haystack's application page.

Similar roles