Lead SRE
Role summary
We are seeking a Lead Site Reliability Engineer (SRE) with extensive experience in SRE practices, microservices, Kubernetes, Docker, and AWS Cloud. The ideal candidate will have a strong understanding of application servers (Oracle, IBM, Tomcat), monitoring tools like New Relic, and logging frameworks such as Elastic/OpenSearch, Logstash, and Kibana. Responsibilities include troubleshooting complex issues like JVM failures and JDBC connection leaks, ensuring high availability and performance of critical systems, and understanding business flows, customer experience, KPIs, and SLAs. Experience in the Telecom domain and CI/CD pipelines is highly desirable.
Must have / Required Skills: Site Reliability Engineering Practices. Should be good at understanding Microservices, KUBERNETES, DOCKER, AWS CLOUD, Oracle/IBM/Tomcat application servers, NewRelic Should have good understanding on Business flows, Customer Experience, KPis and SLA''s. Good Understanding on Logging frameworks and tools like Elastic/Open search, Logstash and Kibana. Experience in troubleshooting JVM failures, JDBC connection leaks and service integration failures Experience with Application Monitoring tools like New Relic. Good to have : 12-15+ years of experience in IT Experience working in Telecom Domain In-depth knowledge of configuring, tuning, and maintaining java application servers and micro services on Kubernetes platform Strong understanding of SDLC Experience working on CI/CD pipelines using FlexDeploy, Jenkins, Artifactory etc
For applications and inquiries, contact: hirings@openkyber.com
Similar roles
SRE LeadGemini Solutions Pvt · Toronto, Ontario, Canada · Onsite- Lead SREJobs via Dice · Mckinney, Texas, United States · Hybrid
Senior SREWaystar · Atlanta, Georgia, United States · Onsite
Team Lead, SRELoblaw Companies Limited · Brampton, Ontario, Canada · Onsite- SRECollabera · Baltimore, Maryland, United States · Remote