Senior DevOps Engineer for Sovereign Cloud Onsite / ApeiroRA / EU AI Projects
Role details
Job location
Tech stack
Job description
At SAP, we keep it simple: you bring your best to us, and we'll bring out the best in you. We're builders touching over 20 industries and 80% of global commerce, and we need your unique talents to help shape what's next. The work is challenging - but it matters. You'll find a place where you can be yourself, prioritize your wellbeing, and truly belong. What's in it for you? Constant learning, skill growth, great benefits, and a team that wants you to grow and succeed.
Location requirement: This position requires the candidate to be physically present in the Berlin, Garching, Dresden or St. Leon Rot office. Please ensure you meet this location requirement before applying. SAP supports relocation for this position and will assist successful candidates with the moving process.
What you'll do
Build enterprise cloud infrastructure that provides European data sovereignty and hyperscaler-grade capabilities. You'll work on SAP Cloud Infrastructure, solving complex distributed systems challenges at scale: multi-region networking, container orchestration, storage systems, and the APIs that connect them.
You will contribute to building a sovereign, privacy-first AI cloud for the European market that makes it easy to securely train, fine-tune, and deploy foundation and domain models with strict data residency and isolation. You'll develop solutions using Go, OpenStack, and Kubernetes, tackling problems like: How do you auto-scale thousands of containers across regions? How do you build resilient storage systems? How do you design APIs that handle massive traffic spikes? Your work will power SAP's production systems and thousands of customer environments. You'll contribute to infrastructure that enables organizations to run mission-critical applications with the performance and reliability they expect from leading cloud platforms.
In your role as DevOps Engineer, you'll build and maintain the infrastructure automation that powers SAP Cloud Infrastructure, implementing CI/CD pipelines and deployment automation for enterprise cloud services. You'll ensure reliable delivery of distributed systems components that handle massive scale across multiple regions. You'll work with Kubernetes, Terraform, and cloud-native tooling to automate the deployment and operation of container orchestration platforms, networking systems, and storage solutions. Your focus will be on creating robust automation that supports multi-region deployments, handles traffic spikes gracefully, and maintains the high availability standards expected by enterprise customers. Contributing to production systems that serve thousands of customer environments, you'll build automation solutions that enable SAP Cloud Infrastructure to compete with leading cloud platforms. Your work will directly support European organizations in running mission-critical applications while maintaining full control and data sovereignty., For SAP employees: Only permanent roles are eligible for the SAP Employee Referral Program, according to the eligibility rules set in the SAP Referral Policy. Specific conditions may apply for roles in Vocational Training. Qualified applicants will receive consideration for employment without regard to their age, race, religion, national origin, ethnicity, gender (including pregnancy, childbirth, et al), sexual orientation, gender identity or expression, protected veteran status, or disability, in compliance with applicable federal, state, and local legal requirements. Successful candidates might be required to undergo a background verification with an external vendor.
AI Usage in the Recruitment Process
For information on the responsible use of AI in our recruitment process, please refer to our Guidelines for Ethical Usage of AI in the Recruiting Process.
Please note that any violation of these guidelines may result in disqualification from the hiring process.
Requirements
- Kubernetes Platform Mastery: Deep hands-on experience operating Kubernetes clusters and managing workloads, upgrades, and platform components such as ingress, service meshes, and cert-manager. CKA, CKAD, or CKS certification are a strong plus. Expert-level knowledge of OpenStack, multi-cloud environments with advanced container orchestration and multi-cluster federation
- Advanced Cloud & Infrastructure Expertise: Mastery of cloud platforms (AWS, Azure, GCP) and advanced Kubernetes management, including multi-cluster operations, scaling, and monitoring
- Automation Proficiency: Experience with Infrastructure as Code tools (e.g. Terraform, Ansible) and designing end-to-end CI/CD pipelines
- Infrastructure Mastery: Deep experience with Terraform, Pulumi, and building self-service developer platforms at hyperscaler grade
- Advanced Scripting Capabilities: Proficient in Go, Python or Bash, or other languages for building complex automation workflows
- Monitoring & Observability: Advanced skills in Prometheus, Grafana, OpenTelemetry, and designing SLI/SLO frameworks for distributed systems
- Platform Engineering Experience: 5-8 years in DevOps or infrastructure roles, with demonstrated ability to design, operate, and evolve Kubernetes-based platforms at scale. Experience with proven ability to lead teams and drive infrastructure initiatives
- Innovative Problem-Solving: Proven ability to diagnose and resolve complex distributed system issues using metrics and tracing tools like Prometheus, Grafana, and Loki; skilled in performance tuning, incident response, and automating operational resilience
- Collaboration & Mentorship: Strong track record leading cross-functional teams, mentoring junior engineers, and driving organizational adoption of best practices
- Strategic Vision: Experience developing infrastructure roadmaps, technology strategy, and managing thousands of workloads in multi-region environments
- Security and Compliance: Awareness of industry standards related to security and compliance and experience integrating regulatory requirements into architectural patterns
- Cultural Adaptability: Experience working across cultures and languages with strong communication skills for diverse, distributed teams
- Soft skills: Excellent presentation, communication and problem-solving skills, have a thorough way of working with accuracy, take proactive initiatives, self-dependent and structured mode of working, high level of commitment and reliability, ready to work in a constantly changing, floating environment, fun working in a multinational team