Senior Advanced Software Engineer - SRE with Virtualization

Phoenix, Arizona, United StatesOnsiteFull TimeSeniorPosted 21 days ago

Compensation estimateAI

See base, equity, bonus, and total comp estimates for this role — free, no credit card.

- Role: Site Reliability Engineer (SRE)
- Location:
Phoenix, AZ, USA (Hybrid)
- Employment Type:
Full-Time

Role Overview

We are hiring for one of our clients seeking a
Site Reliability Engineer (SRE)
to ensure the reliability, scalability, and performance of critical systems and services. In this role, you will work closely with development and operations teams to implement best practices in reliability engineering, automation, and monitoring. You will play a key role in enhancing system stability and operational efficiency within a global 24/7 environment.

Key Responsibilities

Define and manage service SLOs/SLIs, track error budgets, and drive reliability improvements
Identify system bottlenecks and implement proactive remediation strategies
Establish and maintain CI/CD best practices and deployment standards
Implement observability solutions (metrics, logs, traces) using tools like Prometheus, Grafana, ELK, and OpenTelemetry
Build dashboards, alerts, and runbooks to support efficient incident response
Manage incident response processes, including on-call rotations and root cause analysis (RCA)
Conduct performance testing, capacity planning, and cost optimization initiatives
Automate operational processes and reduce manual workload through tooling
Manage Kubernetes clusters and containerized environments
Implement Infrastructure as Code (IaC) using tools like Terraform or CloudFormation
Apply DevSecOps practices including vulnerability management and IAM security
Collaborate with cross-functional teams on system design, deployment, and production readiness
Develop documentation, standards, and knowledge-sharing resources

Required Qualifications & Experience

Bachelor’s degree in Computer Science, Engineering, or a related technical field
4–8+ years of experience in SRE, DevOps, Platform Engineering, or Operations roles
Hands-on experience with cloud platforms (AWS, Azure, or GCP)
Strong knowledge of Docker and Kubernetes (AKS/EKS/GKE)
Experience with observability tools and incident management practices
Proficiency in Infrastructure as Code (Terraform preferred)
Programming/scripting skills in Python, Go, or Bash
Strong analytical, troubleshooting, and problem-solving abilities

Compensation & Benefits

Competitive salary package
Medical, Dental, and Vision Insurance
Life Insurance and Disability coverage
Retirement benefits (e.g., 401(k) or equivalent)
Paid Time Off and company holidays
Employee Assistance Programs
Learning and development opportunities

Inclusion & Diversity

We are committed to creating an inclusive and diverse workplace. All qualified applicants will receive consideration for employment without regard to race, gender, age, disability, or any other protected status.

Apply Now!

Ready to apply?

You'll be redirected to Jobs Ai's application page.

Compensation estimateAI

Similar roles