Infrastructure Engineer
Role summary
Maxana is seeking an experienced Infrastructure Engineer for a fast-growing AI company. This role involves building and maintaining the platform layer for large-scale ML training, inference, and deployment. Key responsibilities include managing GPU and compute infrastructure, improving platform reliability and performance, owning production reliability through monitoring and incident response, and developing internal tooling. The ideal candidate has 5+ years of production infrastructure experience, a strong distributed systems background, and cloud-native expertise (AWS, GCP, or Azure, Docker, Kubernetes). Familiarity with ML infrastructure is a plus.
Maxana is seeking an experienced Infrastructure Engineer for a confidential client — a fast-growing AI company. In this role you will build and maintain the platform layer supporting large-scale ML training, inference, and deployment. This is a high-impact role at the intersection of cloud infrastructure and ML systems.
Key Responsibilities
- Build and maintain infrastructure supporting large-scale ML training and inference workloads
- Work with GPU and compute infrastructure, distributed systems, and cloud-native platforms
- Improve reliability, observability, and performance across the platform layer
- Collaborate directly with senior engineers and product teams on architecture decisions
- Own production reliability — monitoring, incident response, and proactive risk reduction
- Develop and maintain internal tooling and automation to support engineering operations
### Requirements
- 5+ years of infrastructure or platform engineering experience in a production environment
- Strong distributed systems background — experience with large-scale compute workloads preferred
- Cloud-native infrastructure experience — AWS, GCP, or Azure; Docker and Kubernetes required
- Familiarity with ML infrastructure a strong plus — training pipelines, inference serving, GPU workloads
- Experience owning production reliability end to end
### Benefits
- Competitive base salary ($130,000-$240,000) + equity
- Medical, dental, and vision
- Flexible paid time off
- Learning and development stipend
- Working at the forefront of AI infrastructure at scale
Similar roles
- Infrastructure EngineerHorizontal Talent · Brooklyn Park, Minnesota, United States · Hybrid
- Infrastructure EngineerMercor · New York, New York, United States · Remote
- Senior Infrastructure EngineerD&M Machine Company · Cabot, Arkansas, United States · Onsite
- Infrastructure EngineerDTEX · Fremont, California, United States · Hybrid
Infrastructure EngineerRowspace · New York, New York, United States · Onsite