
Site Reliability Engineer
Role summary
We are seeking a Site Reliability Engineer (SRE) with over 10 years of experience to join our team in Toronto. The role requires hands-on expertise with observability and incident management tools such as Dynatrace, Splunk (ITSI), Moogsoft, and PagerDuty. Proficiency in Python scripting and experience with configuration management tools like Ansible, along with version control systems like Git and GitHub Actions, are essential. A strong understanding of distributed systems, cloud environments, and SRE principles is critical. Familiarity with containerization technologies (Kubernetes, Docker), Red Hat OpenShift, AI/ML-driven observability, AIOps platforms, LLM-based automation, Generative AI, ChatOps frameworks, and event-driven architectures (Kafka, RabbitMQ) is also highly valued.
*Role: SRE Engineer*
*Location: Toronto(Onsite)*
*Exp: 10+yrs*
*Required Skills:*
- Hands-on experience with
Dynatrace, Splunk (ITSI), Moogsoft, PagerDuty
- Strong scripting skills in
Python
- Experience with
Ansible, Git, GitHub Actions
- Solid understanding of
distributed systems, cloud, and SRE principles
- Exposure to
Kubernetes / Docker
environments is a plus
- Experience with
AI/ML-driven observability or AIOps platforms
- Experience with
Red Hat OpenShift
- Exposure to
LLM-based automation / Generative AI
- Familiarity with
ChatOps frameworks
- Knowledge of
event-driven architectures (Kafka, RabbitMQ, etc.)
*Regards*
*Praveen Kumar*
*Talent Acquisition Group – Strategic Recruitment Manager*
***praveen.r@themesoft.com
|
Themesoft Inc***
Similar roles
- Senior Site Reliability EngineerParallel Domain · Madrid, Comunidad de Madrid, Spain · Remote
- Site Reliability EngineerPacer Group · Montreal, Quebec, Canada · Hybrid
- Senior Site Reliability EngineerBlock Inc · New York, New York, United States · Remote
- Senior Site Reliability EngineerBlock Inc · Bay, California, United States · Remote
- Senior Site Reliability EngineerUplink · United States · Hybrid