Expert DevOps Engineer for Sovereign Cloud Onsite Project
Role details
Job location
Tech stack
Job description
Build enterprise cloud infrastructure that provides European data sovereignty and hyperscaler-grade capabilities. You'll work on SAP Cloud Infrastructure, solving complex distributed systems challenges at scale: multi-region networking, container orchestration, storage systems, and the APIs that connect them.
You'll develop solutions using Go, OpenStack, and Kubernetes, tackling problems like: How do you auto-scale thousands of containers across regions? How do you build resilient storage systems? How do you design APIs that handle massive traffic spikes?
Your work will power SAP's production systems and thousands of customer environments. You'll contribute to infrastructure that enables organizations to run mission-critical applications with the performance and reliability they expect from leading cloud platforms.
In your role as DevOps Engineer, you'll build and maintain the infrastructure automation that powers SAP Cloud Infrastructure, implementing CI/CD pipelines and deployment automation for enterprise cloud services. You'll ensure reliable delivery of distributed systems components that handle massive scale across multiple regions.
You'll work with Kubernetes, Terraform, and cloud-native tooling to automate the deployment and operation of container orchestration platforms, networking systems, and storage solutions. Your focus will be on creating robust automation that supports multi-region deployments, handles traffic spikes gracefully, and maintains the high availability standards expected by enterprise customers.
Contributing to production systems that serve thousands of customer environments, you'll build automation solutions that enable SAP Cloud Infrastructure to compete with leading cloud platforms. Your work will directly support European organizations in running mission-critical applications while maintaining full control and data sovereignty., For SAP employees: Only permanent roles are eligible for the SAP Employee Referral Program, according to the eligibility rules set in the SAP Referral Policy. Specific conditions may apply for roles in Vocational Training. AI Usage in the Recruitment Process For information on the responsible use of AI in our recruitment process, please refer to our Guidelines for Ethical Usage of AI in the Recruiting Process. Please note that any violation of these guidelines may result in disqualification from the hiring process. Requisition ID: 441505 | Work Area: Software-Design and Development | Expected Travel: 0 - 10% | Career Status: Professional | Employment Type: Regular Full Time | Additional Locations: or Garching or Dresden or St. Leon Rot #LI-Hybrid #MultiSalesDE #SAP-EUCloudAICareers
Requirements
Do you have experience in Terraform?, * Kubernetes Platform Mastery: Deep hands-on experience operating Kubernetes clusters and managing workloads, upgrades, and platform components such as ingress, service meshes, and cert-manager. CKA, CKAD, or CKS certification are a strong plus. Expert-level knowledge of OpenStack, multi-cloud environments with advanced container orchestration and multi-cluster federation
-
Advanced Cloud & Infrastructure Expertise: Mastery of cloud platforms (AWS, Azure, GCP) and advanced Kubernetes management, including multi-cluster operations, scaling, and monitoring
-
Automation Proficiency: Experience with Infrastructure as Code tools (e.g. Terraform, Ansible) and designing end-to-end CI/CD pipelines
-
Infrastructure Mastery: Deep experience with Terraform, Pulumi, and building self-service developer platforms at hyperscaler grade
-
Advanced Scripting Capabilities: Proficient in Go, Python or Bash, or other languages for building complex automation workflows
-
Monitoring & Observability: Advanced skills in Prometheus, Grafana, OpenTelemetry, and designing SLI/SLO frameworks for distributed systems
-
Platform Engineering Experience: 5-8 years in DevOps or infrastructure roles, with demonstrated ability to design, operate, and evolve Kubernetes-based platforms at scale. Experience with proven ability to lead teams and drive infrastructure initiatives
-
Innovative Problem-Solving: Proven ability to diagnose and resolve complex distributed system issues using metrics and tracing tools like Prometheus, Grafana, and Loki; skilled in performance tuning, incident response, and automating operational resilience
-
Collaboration & Mentorship: Strong track record leading cross-functional teams, mentoring junior engineers, and driving organizational adoption of best practices
-
Strategic Vision: Experience developing infrastructure roadmaps, technology strategy, and managing thousands of workloads in multi-region environments