Site Reliability Engineer
Role summary
We are seeking a Senior Site Reliability Engineer (SRE) with a robust engineering background to design, build, and optimize scalable, high-performance systems. The role involves defining and managing SLOs/SLIs, developing CI/CD pipelines with integrated testing and security, and implementing monitoring, alerting, and incident management systems focused on reducing MTTR and MTTM. Responsibilities also include performing Root Cause Analysis, managing deployment strategies like Blue/Green and Canary releases, and optimizing database performance. The SRE will collaborate within Agile Scrum teams to drive continuous delivery improvements.
Job Title: Senior SRE Engineer
We are looking for a highly skilled SRE professional with a strong engineering background to design, build, and optimize scalable, high-performance systems.
🔹 Key Responsibilities:
- Define and manage Service Level Objectives (SLOs) and Service Level Indicators (SLIs)
- Build and maintain scalable CI/CD pipelines with integrated testing and security controls
- Implement monitoring, alerting, and incident management systems (MTTR/MTTM driven)
- Perform Root Cause Analysis (RCA) and drive problem management initiatives
- Manage deployment strategies including Blue/Green and Canary releases
- Optimize database performance, indexing, and query efficiency
- Collaborate within Agile Scrum teams to continuously improve delivery
🔹 Required Skills:
- Strong programming: Python / Java / Go
- Cloud & DevOps: Microsoft Azure, Azure DevOps, Terraform, Jenkins
- Containers: Docker, Kubernetes (AKS preferred)
- Monitoring: Splunk (mandatory), Grafana / Prometheus / ELK / Datadog
- Databases: SQL Server, Oracle, NoSQL (CosmosDB)
- Scripting: PowerShell, Bash
- Tools: Git, SonarQube, Checkmarx
🔹 Good to Have:
- Test automation (Selenium, JMeter, Postman, TestNG)
- Config tools: Chef, Octopus Deploy
- Performance tuning in production environments
Similar roles
- Senior Site Reliability EngineerParallel Domain · Madrid, Comunidad de Madrid, Spain · Remote
- Site Reliability EngineerPacer Group · Montreal, Quebec, Canada · Hybrid
- Senior Site Reliability EngineerBlock Inc · New York, New York, United States · Remote
- Senior Site Reliability EngineerBlock Inc · Bay, California, United States · Remote
- Senior Site Reliability EngineerUplink · United States · Hybrid