DevOps Engineer AI, Madrid
Role details
Job location
Tech stack
Job description
Tech Team, you will play a key role in enabling the design, development, and deployment of scalable and resilient AI solutions. Your mission will be to build and maintain the infrastructure and automation pipelines that support the entire AI lifecycle-from experimentation to production-ensuring agility, security, and operational excellence. You will work in close collaboration with AI Experts, Data Scientists, ML Engineers, and enterprise technology partners to accelerate the delivery of AI products that drive business value across Santander. This is a critical role in the AI transformation agenda, empowering the team to deliver responsible AI at scale. Key Responsibilities: - Design, implement, and manage robust CI/CD pipelines for AI/ML models and GenAI applications - Build and maintain cloud-native infrastructure (AWS, Azure, or GCP) to support the full AI lifecycle: from data ingestion and model training to production deployment and monitoring - Automate infrastructure provisioning
Requirements
using Infrastructure as Code (IaC) tools such as Terraform, CloudFormation, or Pulumi - Ensure scalability, reliability, and high availability of AI services in production environments - Implement and enforce best practices in MLOps, including model versioning, monitoring, rollback, and automated testing - Collaborate with data engineering and ML teams to operationalize AI models and integrate them into business-critical systems - Monitor system performance, debug issues, and lead root-cause analysis and resolution of incidents - Champion security, compliance, and governance standards across the AI tech stack - Contribute to the continuous improvement of DevOps capabilities and tooling within the AI Tech Team - Act as a technical advisor in DevOps and MLOps practices, fostering a culture of automation and engineering excellence Required Experience: - 5+ years of experience in DevOps, Site Reliability Engineering, or Platform Engineering roles - Proven experience supporting ML/AI workloads in production environments - Hands-on experience with containerization (Docker, Kubernetes) and orchestration of microservices - Strong background in managing cloud environments (AWS, Azure, or GCP), including cost optimization and security best practices - Solid experience implementing CI/CD pipelines and using tools such as Jenkins, GitHub Actions, GitLab CI, or similar - Familiarity with machine learning workflows, model deployment patterns, and MLOps tools (e.g., MLflow, Kubeflow, SageMaker, Vertex AI) Education: - BSc or MSc in Computer Science, Engineering, or a related technical field - Relevant certifications in cloud platforms or DevOps practices (e.g., AWS DevOps Engineer, Azure DevOps, Google Cloud DevOps) are a plus Skills Competencies: - Strong scripting and automation skills (e.g., Python, Bash, Go) - Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack, Datadog) - Deep underst