MLOps Engineer (Python) - AI Platform
Role details
Job location
Tech stack
Job description
Write production-grade Python - services, pipelines, shared libraries, internal tooling
Own containerized deployments end to end - Docker, CI/CD, versioning, runtime management
Keep AI agents running at scale - high availability, performance, resilience under load
Build observability into everything - logging, tracing, monitoring, alerting
Manage cloud infrastructure with Terraform across Azure and GCP
Deploy and maintain workflow orchestration (Prefect)
Troubleshoot production issues across the full stack - app to infra
Requirements
3-5 years in Software Engineering, DevOps, or MLOps
Python is your primary language - and you've shipped it in production
Docker and containerized cloud deployments are second nature
You've used Terraform in a real environment, not just tutorials
You think in systems - you debug end to end, not just your layer
CI/CD and Git-based release workflows are part of your daily life