Senior Advanced Software Engineer - SRE with Database Administration
Compensation estimateAI
See base, equity, bonus, and total comp estimates for this role — free, no credit card.
Sign up to see compensation estimate- Job Title:
Senior Advanced Software Engineer - SRE with Database Administration
- Location :
Phoenix, Arizona, United States
- Employment Type :
Full-time
Job Summary
We are hiring for one of our clients seeking a Site Reliability Engineer to ensure the reliability, availability, and performance of large-scale systems and services. This role involves working closely with development and operations teams to implement best practices in reliability engineering, automation, observability, and infrastructure management while driving continuous improvements across platforms.
Key Responsibilities
- Reliability Engineering
Define and manage SLOs/SLIs and track error budgets
- Identify system bottlenecks and lead remediation efforts
- Establish and standardize CI/CD best practices
- Observability & Monitoring
Implement and scale metrics, logs, and tracing systems
- Build dashboards, alerts, and runbooks for production systems
- Incident Management
Own on-call rotations, incident triage, and resolution
- Conduct post-incident reviews and implement corrective actions
- Automate rollback, validation, and recovery processes
- Performance & Capacity Planning
Conduct load and resilience testing
- Optimize system performance, scalability, and cost efficiency
- Automation & Tooling
Develop automation tools to reduce operational overhead
- Standardize deployment, recovery, and operational procedures
- Platform & Infrastructure
Manage Kubernetes clusters and containerized environments
- Implement Infrastructure as Code (IaC) practices
Qualifications
- Education & Experience:
Bachelor’s degree in a technical field (Engineering, Computer Science, etc.)
- 4–8+ years of experience in SRE, DevOps, or Platform Engineering roles
- Technical Skills:
Strong experience with cloud platforms (AWS, Azure, or GCP)
- Expertise in Docker and Kubernetes (EKS/AKS/GKE)
- Proficiency in Infrastructure as Code (Terraform or equivalent)
- Experience with observability tools (Prometheus, Grafana, ELK, OpenTelemetry)
- Programming skills in Python, Go, or scripting (Bash)
Preferred Qualifications
- Advanced degree in Computer Science or related field
- Strong understanding of SLOs, error budgets, and resilience engineering
- Experience with CI/CD tools (Jenkins, GitHub Actions, GitLab CI, Azure DevOps)
- Knowledge of database systems (PostgreSQL, MySQL, MongoDB, etc.)
- Familiarity with networking fundamentals (DNS, TCP/IP, load balancing)
- Experience with GitOps, service mesh (Istio/Linkerd), and microservices architecture
- Cloud and Kubernetes certifications (AWS, Azure, GCP, CKA, CKAD)
- Strong problem-solving, communication, and collaboration skills
Additional Requirements
- Must comply with U.S. export control regulations (U.S. Person requirement or ability to obtain authorization)
Benefits
- Medical, Dental, Vision, and Life Insurance
- Short-Term & Long-Term Disability
- 401(k) with company match
- Paid Time Off and Holidays
- Parental Leave
- Flexible Spending & Health Savings Accounts
- Employee Assistance Programs
- Educational Assistance
Equal Opportunity Statement
We are an equal opportunity employer committed to fostering an inclusive and diverse workplace. All qualified applicants will receive consideration without regard to race, gender, disability, or other protected characteristics.
Apply Now !