DevOps Engineer
Role summary
We are seeking a DevOps Engineer to enhance our systems' build, deployment, and operational processes. This role is crucial for improving our AWS-based infrastructure, CI/CD pipelines, observability, and operational tooling across backend, frontend, and internal services. You will be responsible for building, scaling, and maintaining infrastructure, enhancing monitoring and alerting, and automating repetitive tasks. The position involves supporting production systems, participating in on-call rotations, leading incident response, and driving improvements in reliability and resilience. Experience with cloud environments, containerization, IaC, and observability tools is essential.
### Who you are
- Experience running production systems in AWS or a similar cloud environment
- Experience with CI/CD and infrastructure automation
- Strong understanding of AWS networking, including VPCs, subnets, route tables, security groups, load balancers, DNS and connectivity between services
- Comfort with Linux, shell scripting, Python, and Go
- Experience with Docker and ECS or Kubernetes
- Experience with GitHub Actions, Pulumi, Terraform, or similar tooling
- Experience with Datadog, Prometheus, Grafana, or similar observability tools
- Good understanding of PostgreSQL, Redis, queues, async workers, and scheduled jobs
- Familiarity with Cloudflare or similar edge, networking or traffic management tooling
- A practical approach to automation, reliability and day to day operational work
- Experience with on-call and incident response for business-critical systems
- Strong troubleshooting skills across application, infrastructure, and data layers
### What the job involves
- We are searching for a DevOps Engineer to improve how we build, deploy and run our systems
- This role works across infrastructure, CI/CD, observability and operational tooling in an AWS-based environment spanning backend, frontend and internal services
- Improve and maintain CI/CD, deployment workflows, and environment management across backend, web, and internal services
- Build, maintain and scale infrastructure across AWS and container based services
- Improve monitoring, alerting, logging, dashboards, tracing, and runbooks
- Work with engineers on safer deploys, rollback plans, and recovery from failures
- Automate repetitive operational work and improve internal tooling
- Maintain and improve infrastructure as code and deployment tooling
- Help improve failover planning, recovery procedures, and backup/restore testing for critical systems
- Support production systems and take part in on-call for critical services
- Manage and scale infrastructure across AWS, ECS, Docker, PostgreSQL, Redis, Celery, and Go/Python-based services
- Lead incident response and postmortems, and drive follow-up actions to reduce repeat issues
- Improve reliability, resilience, and operational readiness across critical systems
Similar roles
DevOps EngineerBooz Allen Hamilton · Camp Pendleton South, California, United States · Hybrid- DevOps EngineerAxiom Global Technologies · Toronto, Ontario, Canada · Onsite
- Senior DevOps EngineerRegard · New York, New York, United States · Onsite
- Senior DevOps EngineerZoomInfo · Toronto, Ontario, Canada · Hybrid
DevOps EngineerSchellman · Tampa, Florida, United States · Remote