We're in alpha · Starting with US & Canada
Pacer Group logo
Pacer Group Verified
Manufacturing, Electrical Equipment, Distribution

Site Reliability Engineer

Montreal, Quebec, CanadaHybridContractPosted 1 day ago

7-8 years of experience in SRE / Infrastructure / ops for large-scale systems

Experience in supporting IaaS platforms

Exp. in infrastructure supporting GenAI applications

Should have strong programming/scripting skills (Python, Go, Java)

Experience with containerization (Docker) and orchestration (Kubernetes, etc.) tools

Exp. with IaC (Terraform, Helm, CloudFormation, Ansible, etc.)

Knowledge of GPU / AI compute clusters

Exp. with monitoring/ alerting tools (Prometheus, Grafana, ELK / EFK, Datadog, etc.)

Networking & systems engineering knowledge (TCP/IP, DNS, routing, load balancing, distributed storage)

Ready to apply?
You'll be redirected to Pacer Group's application page.

Similar roles