Senior DevOps / SRE Engineer
Compensation estimateAI
See base, equity, bonus, and total comp estimates for this role — free, no credit card.
Sign up to see compensation estimateSenior DevOps / SRE Engineer
Overview
We are seeking a Senior DevOps / SRE Engineer to own platform reliability, CI/CD pipelines, and cloud infrastructure for a high-scale, production environment. This role is critical to ensuring systems are resilient, scalable, and easy to operate—enabling engineering teams to deploy confidently and recover quickly when issues arise.
You will work closely with platform, data, and product engineering teams to build and maintain a robust, cloud-native infrastructure, with a strong focus on Kubernetes, GitOps, and reliability engineering best practices.
What You’ll Do
- Design, build, and maintain CI/CD pipelines using reusable GitHub Actions workflows
- Own GitOps workflows using ArgoCD, managing application promotion across environments
- Operate and upgrade Kubernetes clusters (EKS), including node groups, autoscaling, and cluster add-ons
- Manage infrastructure as code using Terraform, including PR-driven workflows and state management
- Define and maintain SLOs, alerting strategies, and observability dashboards across platform services
- Operate and maintain secrets management systems (HashiCorp Vault), including policies and authentication
- Implement supply chain security controls including image scanning, signing, SBOM generation, and policy enforcement
- Partner with security teams on network policies, egress controls, and compliance requirements
- Participate in on-call rotations and lead incident response and post-incident reviews
What You Bring
- 6+ years of experience in DevOps, SRE, Platform Engineering, or Production Operations
- Strong experience managing CI/CD pipelines, GitOps workflows, and Kubernetes in production environments
- Experience operating and scaling Kubernetes clusters (EKS preferred)
- Expertise in infrastructure as code (Terraform), including state management and automated deployment workflows
- Proven experience implementing observability and reliability practices (SLOs, alerting, dashboards, incident response)
- Experience with secrets management systems such as HashiCorp Vault
- Strong collaboration skills with the ability to support multiple engineering teams
Technical Expertise
- Kubernetes (cluster operations, autoscaling, RBAC, workload isolation, upgrades)
- GitOps (ArgoCD configuration, sync policies, rollback strategies)
- CI/CD (GitHub Actions, reusable workflows, deployment gates, secrets management)
- Terraform (modular design, state management, Atlantis workflows)
- Observability tools (Prometheus, Grafana, Loki, Tempo, Alertmanager)
- Service mesh (Istio, mTLS, traffic management, authorization policies)
- Autoscaling and provisioning (KEDA, Karpenter)
- Secrets management (HashiCorp Vault)
- Container and supply chain security (Trivy, Cosign, SBOMs, OPA/Gatekeeper)
- Scripting and automation (Python, Bash)
AI / Automation
- Experience leveraging AI tools to accelerate infrastructure development, CI/CD workflows, and operational processes
- Familiarity with AI-assisted incident response, log analysis, and runbook generation
- Ability to integrate AI-driven quality and security checks into delivery pipelines
Key Traits for Success
- Strong ownership mindset over reliability, scalability, and system performance
- Focus on automation and eliminating manual operational work
- Ability to proactively identify and address reliability risks
- Clear and structured communication during incidents and operational events
Why This Role
- High-impact role supporting a complex, production-grade platform
- Opportunity to work with modern cloud-native and GitOps technologies
- Fully remote flexibility within the U.S.
- Collaborative environment with strong engineering alignment
Similar roles
- DevOps / SRE EngineerGalaxy i technologies Inc · Sunnyvale, California, United States · Onsite
- Senior DevOps / SRE EngineerBain & Company · Chicago, Illinois, United States · Hybrid
- Senior DevOps / SRE EngineerBain & Co. · Atlanta, Georgia, United States · Hybrid
- Senior DevOps / SRE EngineerRemoteHunter · United States · Remote
- Senior DevOps / SRE EngineermLabs · United States · Remote