Realign logo
Realign Verified
Software, Business Process Management, Enterprise Architecture

AI Engineer / Intelligent Operations (Infrastructure)-6

Toronto, Ontario, CanadaOnsiteFull TimePosted 2 months ago

Is this role right for you?

Upload your resume and get a skill-by-skill breakdown — see exactly where you match, where you're close, and what to highlight. Not a mystery percentage.

Get a tailored resume highlighting what this role needs.

Role summary

We are seeking an experienced AI Engineer specializing in Intelligent Operations for Infrastructure. This role involves designing and implementing AI-driven solutions to enhance infrastructure monitoring, automation, and operational efficiency. The engineer will work at the intersection of AI/ML, cloud infrastructure, and DevOps to build intelligent operational systems. Key responsibilities include developing and deploying AI/ML models, automating incident response, integrating AI with cloud platforms, building data pipelines, and optimizing system performance. The role requires strong Python and AI/ML framework experience, knowledge of cloud platforms, Docker, Kubernetes, and DevOps/CI/CD practices.

Toronto, Ontario M5V 3L9 Posted March 29th, 2026

Looking for more job opportunities? Click here!

Job Type: Full Time

Job Category: IT

Job Description

### Role: AI Engineer – Intelligent Operations (Infrastructure)

### Location: Toronto, ON

### Employment Type: Full-Time (FT)

### Work Mode: Onsite

## Job Description:

We are seeking an experienced AI Engineer – Intelligent Operations (Infrastructure) to design and implement AI-driven solutions that enhance infrastructure monitoring, automation, and operational efficiency. The ideal candidate will work at the intersection of AI/ML, cloud infrastructure, and DevOps to build intelligent operational systems.

## Key Responsibilities:

Develop and deploy AI/ML models for infrastructure monitoring and predictive maintenance

Automate incident detection, root cause analysis, and remediation workflows

Integrate AI solutions with cloud and on-prem infrastructure platforms

Build data pipelines for infrastructure logs and telemetry analysis

Collaborate with DevOps, SRE, and Cloud teams

Optimize system performance, scalability, and reliability

Implement MLOps practices for model deployment and lifecycle management

Provide technical leadership and documentation

## Required Skills:

Strong experience in Python and AI/ML frameworks (TensorFlow, PyTorch, Scikit-learn)

Experience working with infrastructure monitoring data (logs, metrics, traces)

Knowledge of cloud platforms (AWS, Azure, or GCP)

Experience with Docker and Kubernetes

Understanding of DevOps and CI/CD practices

Strong analytical and problem-solving skills

## Preferred Qualifications:

Experience in AIOps or Intelligent Automation

Knowledge of monitoring tools (Splunk, Datadog, Prometheus, etc.)

Experience with MLOps tools (MLflow, SageMaker, Vertex AI)

Strong communication and stakeholder collaboration skills

Required Skills

DEVOPS ENGINEER

SENIOR EMAIL SECURITY ENGINEER

Ready to apply?
You'll be redirected to Realign's application page.