Senior AI Infrastructure Engineer

BMW AG
München, Germany
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

München, Germany

Tech stack

Artificial Intelligence
Amazon Web Services (AWS)
Azure
Nvidia CUDA
Continuous Integration
Distributed Systems
InfiniBand
Python
Machine Learning
Performance Tuning
Ansible
AI Infrastructure
Cloud Platform System
Kubernetes
Infrastructure Automation Frameworks
Information Technology
Slurm
Hardware Infrastructure
Terraform
Docker

Job description

Eine innovative Unternehmenskultur in einem so vielschichtigen Konzern wie der BMW Group lebt von komplexen Systemen und Netzwerken. Mit guten Ideen, Begeisterung und Teamgeist entwickeln unsere IT-Exper:innen unverwechselbar smarte und moderne Systeme. Dabei profitieren sie von ausreichend Budgets, aber auch von standardisierten Prozessen, um Lösungen effizienter umzusetzen. So kann eine IT realisiert werden, die neue Möglichkeiten schafft und damit die Basis unserer Unternehmenskultur und unseres Erfolges sichert. We shape the future of domain-specific AI systems at the BMW Group by designing, training, and operating new foundation models. Our team sets standards for safe and scalable AI in engineering and production. What awaits you?

  • You will design, build, and operate GPU-centric AI infrastructure (especially NVIDIA) across on-prem and cloud environments, with a strong focus on performance, scalability, and efficiency.
  • As part of your role, you take ownership of the architecture and operation of high-performance compute environments for distributed training and optimised model execution.
  • By optimizing compute, storage, and high-performance networking (e.g., InfiniBand, NCCL), you enable large-scale AI workloads in industrial contexts.
  • You are responsible for developing and operating core infrastructure components such as scheduling and resource management systems (e.g., SLURM, Ray, Run:ai), ensuring efficient utilization of shared GPU resources.
  • Using modern tooling, you build and maintain automated, reproducible infrastructure (e.g., Docker, Kubernetes, Terraform, Ansible, CI/CD).
  • You contribute to BMW-specific AI use cases by providing reliable and scalable infrastructure.
  • Your role is rounded by taking technical ownership of the AI infrastructure stack, defining best practices, and guide less experienced engineers.

Requirements

  • University degree in Computer Science, Computer/Electrical Engineering or related subjects.
  • Several years of professional experience (8-10 years) in industry, building and operating AI and HPC infrastructure.
  • Strong hands-on experience with GPU systems (especially NVIDIA), including drivers, CUDA, and performance optimisation.
  • Experience with distributed systems and high-performance networking (e.g. InfiniBand, NCCL), combined with experience in cloud environments (AWS, Azure) alongside on-prem infrastructure.
  • Practical experience with resource scheduling and workload orchestration (e.g., SLURM, Ray, NVIDIA Run:ai).
  • Strong experience in infrastructure automation (e.g., Docker, Kubernetes, Terraform, Ansible, CI/CD) and proficiency in Python for infrastructure and system-level tooling.
  • Experience with training, fine-tuning, or serving ML models in production as well as exposure to large-scale industrial AI use cases (e.g. simulation, robotics, engineering) are nice to have.

Benefits & conditions

Note: Please apply exclusively online via our career portal. Applications through other channels (especially email) cannot be considered. What do we offer?

  • Challenging projects with which we shape the mobility of tomorrow together.
  • Wide range of personal and professional development opportunities.
  • Attractive, fair and performance-related remuneration.
  • High level of job security.
  • Annual special payments such as vacation pay, Christmas bonus, and profit sharing.
  • Flexible working hours including six weeks annual leave and overtime compensation.
  • Discounted BMW & MINI conditions.
  • Many other benefits at

Earliest starting date: from now on Type of employment: unlimited Working hours: full-time If you apply, the next steps in the selection process include an online test and subsequent interviews with the hiring manager (either virtually or in person). You can find helpful tips on your application and the application process . At the BMW Group, we place great importance on equal treatment and equal opportunities. Our recruiting decisions are based on the personality, experience, and skills of the applicants. Learn more .

Apply for this position