We're in beta · Starting with US & Canada · Shipping weekly — your feedback shapes RiseMe
Jobs Ai logo
Jobs Ai Verified
Human Resources, Artificial Intelligence, Software

Senior Advanced Software Engineer - SRE with Virtualization

Phoenix, Arizona, United StatesOnsiteFull TimeSeniorPosted 21 days ago

Compensation estimateAI

See base, equity, bonus, and total comp estimates for this role — free, no credit card.

Sign up to see compensation estimate

- Role: Site Reliability Engineer (SRE)
- Location:
Phoenix, AZ, USA (Hybrid)
- Employment Type:
Full-Time

Role Overview

We are hiring for one of our clients seeking a
Site Reliability Engineer (SRE)
to ensure the reliability, scalability, and performance of critical systems and services. In this role, you will work closely with development and operations teams to implement best practices in reliability engineering, automation, and monitoring. You will play a key role in enhancing system stability and operational efficiency within a global 24/7 environment.

Key Responsibilities

  • Define and manage service SLOs/SLIs, track error budgets, and drive reliability improvements
  • Identify system bottlenecks and implement proactive remediation strategies
  • Establish and maintain CI/CD best practices and deployment standards
  • Implement observability solutions (metrics, logs, traces) using tools like Prometheus, Grafana, ELK, and OpenTelemetry
  • Build dashboards, alerts, and runbooks to support efficient incident response
  • Manage incident response processes, including on-call rotations and root cause analysis (RCA)
  • Conduct performance testing, capacity planning, and cost optimization initiatives
  • Automate operational processes and reduce manual workload through tooling
  • Manage Kubernetes clusters and containerized environments
  • Implement Infrastructure as Code (IaC) using tools like Terraform or CloudFormation
  • Apply DevSecOps practices including vulnerability management and IAM security
  • Collaborate with cross-functional teams on system design, deployment, and production readiness
  • Develop documentation, standards, and knowledge-sharing resources

Required Qualifications & Experience

  • Bachelor’s degree in Computer Science, Engineering, or a related technical field
  • 4–8+ years of experience in SRE, DevOps, Platform Engineering, or Operations roles
  • Hands-on experience with cloud platforms (AWS, Azure, or GCP)
  • Strong knowledge of Docker and Kubernetes (AKS/EKS/GKE)
  • Experience with observability tools and incident management practices
  • Proficiency in Infrastructure as Code (Terraform preferred)
  • Programming/scripting skills in Python, Go, or Bash
  • Strong analytical, troubleshooting, and problem-solving abilities

Compensation & Benefits

  • Competitive salary package
  • Medical, Dental, and Vision Insurance
  • Life Insurance and Disability coverage
  • Retirement benefits (e.g., 401(k) or equivalent)
  • Paid Time Off and company holidays
  • Employee Assistance Programs
  • Learning and development opportunities

Inclusion & Diversity

We are committed to creating an inclusive and diverse workplace. All qualified applicants will receive consideration for employment without regard to race, gender, age, disability, or any other protected status.

Apply Now!

Ready to apply?
You'll be redirected to Jobs Ai's application page.

Similar roles