Senior Site Reliability Engineer
Role summary
Seeking a Senior Site Reliability Engineer / Cloud SME for a long-term contract role. This 100% remote position within the US requires a technical leader to drive the rearchitecting of a monolithic system into a resilient, cloud-native architecture using AWS, Azure, and Kubernetes. Responsibilities include leading transformation efforts, implementing robust cloud-native components, developing monitoring and alerting, and driving DevOps practices. The role also involves mentoring junior team members and staying current with emerging technologies. A Bachelor's or Master's degree in a related field is required.
Required U.S. Citizenship / No clearance needed / 100% remote within the US / CST Time Zone
Senior Site Reliability Engineer / Cloud SME
Location:
100% remote in the continental US
Type:
Long-term contract (3+ years)
Role Summary
As the Senior SRE/Cloud SME, you will be a critical technical leader driving the rearchitecting of our existing monolithic system into a resilient, cloud-native architecture. This role requires deep expertise across multiple cloud platforms (Azure and AWS) and container orchestration (Kubernetes) to ensure the next-generation platform meets the highest standards of scalability, reliability, and security.
Key Responsibilities
Architecture & Transformation Leadership
- Lead the technical rearchitecting efforts, transforming a large-scale monolithic system into a modern microservices-based, cloud-native application.
- Collaborate with cross-functional teams (Engineering, Architecture, Product) to define and implement the new system architecture using domain-driven design (DDD) principles.
- Conduct technology evaluations and provide recommendations for new tools, frameworks, and cloud services to enhance our infrastructure.
Reliability Engineering & Cloud Operations
- Utilize
Kubernetes (K8S)
for container orchestration and management, ensuring extreme scalability, reliability, and high availability of the system.
- Implement robust, highly resilient, and highly available components for the system.
- Develop and implement comprehensive monitoring, logging, and alerting mechanisms to ensure optimal system performance and availability.
- Drive the adoption of DevOps principles and practices throughout the software development lifecycle, ensuring seamless integration and continuous deployment processes.
Technical Expertise & Mentorship
- Stay up-to-date with emerging technologies, frameworks, and industry trends related to systems and cloud computing.
- Mentor and provide technical guidance to junior team members, fostering a culture of continuous learning and professional growth.
Required Qualifications
- Cloud Platforms:
7+ years of experience with cloud computing platforms. Strong multi-cloud expertise required with
AWS
and
Azure
.
- Cloud-Native Transformation:
7+ years of experience in rearchitecting large-scale monolithic applications to cloud-native architectures.
- Container Orchestration:
Strong expertise in
Kubernetes (K8S)
is required, including hands-on experience with both
AKS (Azure Kubernetes Service)
and
EKS (Elastic Kubernetes Service)
.
- Networking:
Strong experience with
Cloud Networking
, with the ability to design and resolve complex cloud networking architecture problems.
- IaC:
Expert knowledge of
Terraform
for infrastructure-as-code deployment and management.
- Security:
Must possess strong knowledge of security best practices for containers and Kubernetes clusters.
- Education:
Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field.
- Bonus Knowledge:
Knowledge of load balancing algorithms.
Thanks for applying!
Similar roles
- Senior Site Reliability EngineerParallel Domain · Madrid, Comunidad de Madrid, Spain · Remote
- Site Reliability EngineerPacer Group · Montreal, Quebec, Canada · Hybrid
Senior Site Reliability EngineerBasis Theory · United States · Remote- Senior Site Reliability EngineerBlock Inc · New York, New York, United States · Remote
- Senior Site Reliability EngineerBlock Inc · Bay, California, United States · Remote