Manager Site Reliability Engineering
Compensation estimateAI
See base, equity, bonus, and total comp estimates for this role — free, no credit card.
Sign up to see compensation estimate- About Our Client:
The organization operates in the managed security services industry, addressing the growing challenge of cybersecurity threats that require continuous, around-the-clock protection. It delivers cloud-based security operations platforms that offer rapid and comprehensive detection and automated response to cyber threats. By providing tailored expert guidance, the organization helps clients reduce risk and improve their security posture. Serving hundreds of customers ranging from Fortune 100 companies to mid-sized enterprises, the program focuses on solving complex cyber challenges through a dedicated and values-driven team. The organization has received multiple industry recognitions and significant investment backing, reflecting its scale and impact in cybersecurity.
- About the Opportunity:
The Manager, Site Reliability Engineering leads the design, automation, and reliability of secure, scalable cloud infrastructure and developer platforms within a cybersecurity environment. This role drives operational resilience and high availability while managing and mentoring a high-performing SRE team. The position balances hands-on engineering with leadership responsibilities to enhance platform stability, security, and developer efficiency, directly contributing to the organization''s ability to deliver reliable cybersecurity services.
- Responsibilities:
• Lead and grow the SRE team, setting direction and mentoring engineers
• Design and manage cloud and containerized infrastructure using Infrastructure as Code tools like Terraform
• Implement secure and compliant CI/CD pipelines
• Build and maintain observability systems, defining SLIs, SLOs, and dashboards
• Manage incident response, root cause analysis, and postmortems, automating recovery processes
• Oversee capacity planning, performance tuning, and cost optimization
• Collaborate with InfoSec, DevSecOps, and Compliance teams to ensure alignment with frameworks such as FedRAMP, NIST, and RMF
• Support program-level initiatives with clear communication to stakeholders
• Foster a culture of reliability, security, and developer efficiency
• Dedicate approximately 75% of time to engineering tasks and 25% to leadership and management
- Requirements:
• 8+ years of experience in SRE, DevOps, or Platform Engineering with technical leadership skills suited for a player/coach role
• Proven expertise with cloud platforms (AWS, GCP) and container orchestration (Kubernetes, Docker)
• Strong coding and scripting skills in Python or GO, and proficiency with IaC and GitOps
• In-depth knowledge of observability tools and reliability metrics
• Experience in incident management using tools such as PagerDuty and Datadog
• Track record of mentoring and developing junior and mid-level SRE engineers through hands-on coaching
• Familiarity with cybersecurity and regulatory frameworks including FedRAMP, NIST, STIGs, and RMF
• Excellent communication and stakeholder management abilities
• Preferred certifications: AWS, CKA, or cybersecurity credentials such as OSCP
- Pay Range and Compensation Package:
• The anticipated salary range for this role is $178,000 to $213,000 plus bonus, stock options, and benefits
• Actual compensation may vary based on geographic location, experience, education, and skill level
- Benefits & Perks:
• Medical, dental, vision, and disability insurance
• Flexible Time Off (FTO), 12 company holidays, sick leave, and 8 weeks of paid parental leave
Equal Opportunity Statement: Our client is an equal opportunity employer. They celebrate diversity and are committed to creating an inclusive environment for all employees. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, or national origin.
Note:
RemoteHunter is not the Employer of Record (EOR) for this role. Our purpose in this opportunity is to connect exceptional candidates with leading employers. We help job seekers worldwide discover roles that match their goals and guide them to complete their full application directly through the hiring company’s career page or ATS.
Similar roles
- Director of Site Reliability EngineeringJPMorganChase · Palo Alto, California, United States · Onsite
- Director of Site Reliability EngineeringHarrison Clarke · San Francisco, California, United States · Hybrid
- Software Engineer - Site Reliability EngineeringZoox (Amazon) · Foster City, California, United States · Hybrid
- Site Reliability EngineeringZoox (Amazon) · Foster City, California, United States · Hybrid
- Director of Site Reliability EngineeringSolutionForge Systems · United States · Remote