DevOps Manager

Esw.
3 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Tech stack

Artificial Intelligence
Amazon Web Services (AWS)
Azure
Cloud Computing
Cloud Computing Security
Cloud Engineering
Continuous Integration
DevOps
Identity and Access Management
Systems Development Life Cycle
Role-Based Access Control
Site Reliability Engineering Practices
Prometheus
Datadog
Google Cloud Platform
Cloud Monitoring
System Availability
Grafana
Containerization
Kubernetes
Deployment Automation
Bicep
Terraform

Job description

We are seeking an experienced and strategic Senior Manager - DevOps & SRE to lead and evolve our reliability and platform engineering capabilities across our global eCommerce ecosystem.

This role goes beyond traditional DevOps management. You will be responsible for defining and driving our reliability strategy, embedding SRE principles (SLIs, SLOs, error budgets), and ensuring our platforms operate at scale with high availability, performance, and resilience.

You will lead a distributed team of DevOps and SRE engineers, working closely with Engineering, Product, Security, and Architecture to enable reliable, scalable, and automated cloud-native systems across Azure, AWS, and GCP., Leadership & Organizational Impact Lead, mentor, and grow a high-performing DevOps & SRE function. Define clear ownership models, reliability standards, and ways of working. Elevate engineering maturity through automation, observability, and operational excellence. Drive accountability and promote a culture of reliability, learning, and continuous improvement. Partner with senior stakeholders to align platform reliability with business objectives. Reliability Strategy & SRE Practices

Define and implement SRE best practices (SLIs, SLOs, error budgets). Own incident management strategy, postmortems, and systemic improvements. Improve resilience through proactive risk identification and mitigation. Establish measurable reliability KPIs aligned with customer experience. Cloud Infrastructure & Platform Engineering

Oversee cloud operations primarily in Microsoft Azure, with exposure to AWS and GCP. Ensure infrastructure is scalable, secure, and cost-efficient. Drive Infrastructure as Code adoption (Terraform, Bicep/ARM). Define platform standards for Kubernetes and containerized environments. Automation, CI/CD & Developer Enablement

Champion CI/CD best practices and release reliability. Improve deployment strategies (blue/green, canary releases). Reduce operational toil through automation and self-healing systems. Support high-traffic eCommerce events and critical production workloads. Observability & Operational Excellence

Define and evolve our observability strategy (Azure Monitor, Grafana, Datadog, Prometheus, etc.). Improve signal-to-noise ratio in monitoring and alerting. Drive root cause analysis discipline and continuous improvement loops. Explore AI-assisted operations for incident detection, alert optimization, and operational efficiency. Security & Compliance

Ensure secure cloud practices (IAM, least privilege, data protection). Partner with Security to enforce compliance and governance standards. Embed security and reliability into the full SDLC lifecycle. Key Experience & Skills

Requirements

10+ years of experience in DevOps, SRE, or Platform Engineering roles. 3+ years leading and scaling technical teams. Strong hands-on background in Microsoft Azure (required) AWS (required) and GCP (nice to have). Deep understanding of cloud-native architectures and Kubernetes. Proven experience implementing SRE frameworks (SLAs, SLOs, incident management). Strong experience with Infrastructure as Code (Terraform, ARM/Bicep). Observability expertise (Grafana, Datadog, Prometheus, Azure Monitor). Experience managing production systems at scale (high-traffic environments preferred). Strong stakeholder management and communication skills. Strategic mindset with the ability to balance technical depth and business impact. Nice to Have

Experience in global eCommerce platforms. Experience leading cloud transformation initiatives. Exposure to AI-driven operational tooling. Relevant certifications (Azure, Kubernetes, Cloud Architecture).

Benefits & conditions

Competitive salary and benefits: Your financial well-being is important to us. Join ESW and experience the satisfaction of being rewarded for your hard work, dedication, and commitment.

About the company

International environment: Work with people from over 30 different cultures and get the chance to use English daily. Professional and personal development: We will ensure your talent is nurtured and cultivated for growth and success throughout your career with ESW. Hybrid working: Enjoy the best of both worlds with 2-3 days in our office in Méndez Alvaro, and 2-3 days working from the comfort of your home. Diversity, Belonging & Inclusion: When we win, we win together. You'll be part of a culture that values every individual for who they are, fostering an environment where uniqueness is encouraged. ESW is an equal opportunity employer, and we're proud of our ongoing efforts to foster diversity, equity, & inclusion in the workplace. Individuals seeking employment and employees at ESW are considered without regard to race, color, religion, national origin, age, sex, gender, gender identity, gender expression, sexual orientation, marital status, medical condition, ancestry, disability, military or veteran status, or any other characteristic protected by applicable law. If you require any reasonable accommodations or adjustments throughout the hiring process, please let us know. We are dedicated to ensuring equal access and opportunity for all candidates. #LI-hybrid #LI-TS1 False FULL_TIME Organization Esw Unknown false Place PostalAddress España España España GeoCoordinates

Apply for this position