DevOps Engineer - Canada Wide - Remote
Role summary
Newton is seeking a remote DevOps Engineer to enhance their systems for building, deploying, and running services. This role involves working with AWS infrastructure, CI/CD pipelines, observability tools, and operational tooling across backend, frontend, and internal services. Responsibilities include maintaining and scaling infrastructure, improving monitoring and alerting, automating operational tasks, and supporting production systems. The ideal candidate has experience with cloud environments like AWS, CI/CD, infrastructure automation, containerization (Docker, ECS, Kubernetes), IaC tools (Terraform, Pulumi), observability tools (Datadog, Prometheus, Grafana), and programming languages like Python and Go. Experience with incident response and troubleshooting across application, infrastructure, and data layers is crucial.
Some of our values:
- Customer first mindset - Commitment to integrity and transparency to our users!
- A dynamic team fueled by collaboration uniting our strengths to overcome any obstacles. Together we build success. We persevere, adapt, and come back stronger, turning obstacles into opportunities.
- We strive for continuous improvement and embrace creativity and encourage experimentation. We push the boundaries of what’s possible and continuously explore new ideas, technologies, and solutions.
Role Overview:We are searching for a DevOps Engineer to improve how we build, deploy and run our systems. This role works across infrastructure, CI/CD, observability and operational tooling in an AWS-based environment spanning backend, frontend and internal services.
Who you are:• Experience running production systems in AWS or a similar cloud environment• Experience with CI/CD and infrastructure automation• Strong understanding of AWS networking, including VPCs, subnets, route tables, security groups, load balancers, DNS and connectivity between services• Comfort with Linux, shell scripting, Python, and Go• Experience with Docker and ECS or Kubernetes• Experience with GitHub Actions, Pulumi, Terraform, or similar tooling• Experience with Datadog, Prometheus, Grafana, or similar observability tools• Good understanding of PostgreSQL, Redis, queues, async workers, and scheduled jobs• Familiarity with Cloudflare or similar edge, networking or traffic management tooling• A practical approach to automation, reliability and day to day operational work• Experience with on-call and incident response for business-critical systems• Strong troubleshooting skills across application, infrastructure, and data layers
Originally posted on Himalayas