Patient Life Care logo
Patient Life Care Verified
Healthcare Services, Home Healthcare

AI DevOps/Platform Engineers

CanadaRemoteFull Time$40–$50 /hrPosted 2 months ago

Is this role right for you?

Upload your resume and get a skill-by-skill breakdown — see exactly where you match, where you're close, and what to highlight. Not a mystery percentage.

Get a tailored resume highlighting what this role needs.

Role summary

The AI DevOps/Platform Engineer will be instrumental in building, maintaining, and scaling enterprise AI infrastructure for the AI Enablement team. This role involves developing and operating proprietary agent orchestration platforms (NOVA), managing AI gateway services (LiteLLM), and optimizing Retrieval-Augmented Generation (RAG) pipelines across multi-cloud environments (GCP, Azure). Key responsibilities include deploying services on Kubernetes, implementing CI/CD pipelines, automating infrastructure with tools like Terraform, and ensuring system observability and security. Proficiency in Python/TypeScript and experience with AI/ML platforms are essential for this remote, permanent position.

The AI DevOps/Platform Engineer will join the AI Enablement team, focusing on building, maintaining, and scaling enterprise AI infrastructure. This includes proprietary agent orchestration platforms (NOVA), AI gateway services, and Retrieval-Augmented Generation (RAG) pipelines across multi-cloud environments.

Key Responsibilities:

  • Platform Development & Operations:
  • Develop, deploy, and maintain the NOVA agentic AI platform
  • Manage LiteLLM as the central AI gateway
  • Optimize LLM routing, cost control, load balancing, and failover
  • Implement monitoring and observability (Prometheus, Grafana, OpenTelemetry)
  • RAG Pipeline Development:
  • Design and optimize RAG pipelines
  • Maintain document ingestion, chunking, embeddings, and vector stores
  • Build RAG on GCP and Azure using managed AI services and vector databases
  • Infrastructure & DevOps:
  • Deploy AI services on Kubernetes (AKS, GKE)
  • Implement CI/CD with Jenkins, Opsera, GitHub Actions
  • Automate infrastructure (Terraform, Helm, GitOps)
  • Ensure security and compliance
  • Agentic AI & Automation:
  • Develop automation tools and scripts
  • Build MCP servers for tool integrations
  • Enable multi-agent orchestration and autonomous workflows
  • Create SDKs, APIs, and developer documentation

Required Qualifications:

  • 8+ years platform engineering/DevOps experience
  • 2+ years AI/ML or LLM platform experience
  • Strong Kubernetes, CI/CD, and cloud experience (GCP or Azure)
  • Proficiency in Python and/or TypeScript

Technical Environment:

  • AI Platforms: LiteLLM, LangChain, LangGraph
  • Cloud: GCP, Azure
  • Containers: Kubernetes, Docker, Helm
  • CI/CD: Jenkins, GitHub Actions, Opsera
  • Observability: Prometheus, Grafana, OpenTelemetry, Dynatrace
  • Languages: Python, TypeScript, Bash

Job Type: Permanent

Pay: $40.00-$50.00 per hour

Work Location: Remote

Ready to apply?
You'll be redirected to Patient Life Care's application page.

Similar roles