University degree in Computer Science, Engineering, or a related field.
3-5 years of experience in big data or data engineering with a focus on analytics and search over very large datasets.
Strong Python and SQL skills, with experience in at least one distributed compute framework.
Experience with columnar or analytical storage and query optimisation at PB scale.
Familiarity with search and indexing technologies, including full-text search, vector/embedding search or metadata catalogues.
Production experience with Kubernetes and AWS / Azure / Google Cloud, as well as hands-on experience with infrastructure-as-code.
Experience with automotive measurement data (MDF4/ASAM MDF or MCAP) as well as with embedding-based retrieval, dataset management tools, stream processing, or graph-based metadata systems.
Benefits & conditions
Are you ready to shape the future of data-driven mobility with us? Apply now!Note: Please apply exclusively online via our career portal. Applications through other channels (especially email) cannot be considered.What do we offer?
Challenging projects with which we shape the mobility of tomorrow together.
Wide range of personal and professional development opportunities.
Attractive, fair and performance-related remuneration.
High level of job security.
Annual special payments such as vacation pay, Christmas bonus, and profit sharing.
Flexible working hours including six weeks annual leave and overtime compensation.
Discounted BMW & MINI conditions.
Many other benefits at
Earliest starting date: from now onType of employment: unlimitedWorking hours: full-timeIf you apply, the next stages of the recruiting process include an online test and then technical interviews with the hiring manager (either virtual or in person).You can find helpful tips on your application and the application process .At the BMW Group, we place great importance on equal treatment and equal opportunities. Our recruiting decisions are based on the personality, experience, and skills of the applicants. Learn more .
About the company
BMW Alle Jobs anzeigen
* München, Bayern
* Unbefristet
* Vollzeit
* 2 Stunden her
INNOVATION IS IMAGINING WHAT NO ONE ELSE CAN.At the BMW Group, everything begins with passion. It transforms a profession into a vocation. It drives us to continually reinvent mobility and bring innovative ideas to the roads. Enthusiasm for collaborative projects turns a team into a strong unit where every opinion is valued. It is only when expertise, highly professional processes, and enjoyment of work come together that we can shape the future collectively.We train models on petabyte-scale automotive sensor data, but training is only half the story. Before a single GPU cycle is spent, engineers need to find, filter, evaluate, and understand the data. We build the analytics and search infrastructure that makes petabytes of measurements and recordings queryable in seconds, enabling rapid dataset assembly, quality analysis, and model evaluation at scale.What awaits you?
* You design and build high-performance search and query pipelines over PB-scale MDF4 and MCAP data lakes, enabling ML engineers to find relevant driving scenarios, sensor conditions, and edge cases across billions of records in seconds.
* Furthermore, you build and operate indexing and cataloguing systems for automotive sensor data, including metadata extraction, signal-level indexing, scene tagging, and embedding-based similarity search.
* You implement distributed compute pipelines for large-scale data evaluation, such as batch statistics, distribution analysis, annotation coverage reports, and data-quality scoring.
* In addition, you build fast analytical queries that enable interactive exploration on top of raw data.
* You develop dataset assembly pipelines that automatically assemble, version, and register training and evaluation datasets.
* You optimise for cost and performance through intelligent partitioning, tiered storage, caching strategies, and query pushdown to minimise scan volumes over PB-scale data.
* You operate observability stacks for data pipelines, including query latency dashboards, pipeline health, and data freshness monitors.
Apply for this position
Good distractions
Talks and stories from around this role — technically off-topic, practically not.
Moments
05:56 MIN
Why a software leader joined the automotive sector
Inside Mercedes-Benz: How CIO Katrin Lehmann is Empowering 5,000 Developers and Driving Digital Change
15:03 MIN
What Bosch looks for in engineering talent
The future of automotive mobility: Upcoming E/E architectures, V2X and its challenges
02:27 MIN
Building an AI talent hub outside major tech capitals
When worlds collide: How will generative AI change the way we design and build software
06:13 MIN
Skills and challenges of working with automotive AI
Developing an AI.SDK
01:26 MIN
Recognizing Germany's vibrant AI startup ecosystem
Pioneering AI Assistants in Banking
06:35 MIN
From economics student to hands-on technologist at SAP
Inside Mercedes-Benz: How CIO Katrin Lehmann is Empowering 5,000 Developers and Driving Digital Change
02:06 MIN
Applying machine learning in the automotive industry
Getting Started with Machine Learning
02:20 MIN
The evolving role of the machine learning engineer