Research Engineer - Vision Language Action Models for Intelligent Cyber Physical Systems iv.)

Robert Bosch GmbH
Renningen, Germany
4 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English, German

Job location

Renningen, Germany

Tech stack

Artificial Intelligence
Cloud Computing
Continuous Integration
Python
Machine Learning
TensorFlow
Cyber-physical Systems
Reinforcement Learning
PyTorch
Deep Learning
Generative AI
Information Technology
Docker

Job description

  • As a Research Engineer, you will develop cutting-edge Vision-Language-Action (VLA) architectures that empower AI agents to interpret human instructions and act autonomously in complex environments.
  • Furthermore, you will connect multimodal representation learning with long-term control and planning to create agents that go beyond reactive capabilities and exhibit cognitive intelligence.
  • You will contribute to fundamental and applied research on novel VLA models and actively drive their advancement.
  • Building a scalable infrastructure for experimentation, training, and deployment, including the development of training pipelines, simulation tools, and evaluation methods will be part of your task.
  • You will make VLA methods usable for concrete Bosch applications in practice and demonstrate their superior flexibility and generalization capabilities.
  • Last but not least, you will work closely with interdisciplinary teams of researchers and application experts to shape the early innovation phase and the long-term strategy for intelligent automation at Bosch.

Requirements

Do you have a Master's degree?, + excellent MSc in Computer Science, Machine Learning, Robotics or related technical fields

  • PhD in Multimodal AI, Robotics, Reinforcement Learning, or Generative AI preferred
  • demonstrated academic excellence, with a strong publication record in leading AI and robotics conferences and journals (NeurIPS, ICLR, ICML, CVPR, CoRL, RSS, ICRA, ACL, EMNLP, etc.)
  • Experience and Knowledge:
  • Industrial SW development experience: o Demonstrated industry-relevant practical AI experience, e.g. by code contributions in industrial AI applications, in large scale machine learning projects, or participation in large-scale AI benchmarks and contests o Multiple years of experience in developing and deploying machine learning solutions in distributed SW development teams o Demonstrated capability to go beyond research prototypes and integrate cutting-edge AI methods into practically relevant and usable SW solutions

  • Multimodal AI & Vision-Language Models: o Proficiency in designing and training vision-language models (VLMs) and language-multimodal models (LMMs) (e.g., Flamingo, GPT-4V, PaLM-E, RT-2) o Hands-on experience with visual grounding, cross-modal attention, and instruction-following architectures o Familiarity with benchmarks such as ALFRED, Ego4D, VLN, or datasets involving perception and action

  • Control, Reinforcement Learning & Planning: o Strong experience in reinforcement learning, imitation learning, or model-based control for agents operating in real or simulated environments o Capability in designing agents for long-horizon planning, semantic task decomposition, and hierarchical control o Knowledge of methods that combine perception, language, and action in task-driven settings

  • AI for Cyber-Physical Systems & Automation: o Demonstrated capability to integrate cutting-edge AI methods into practical applications in robotics, automated o driving, industrial automation, or building systems interest in emerging domains such as smart heating and HVAC control, where semantic AI can optimize control strategies o Focus on building robust, explainable, and semantically grounded agents for physical deployment

  • Infrastructure, Simulation & Tooling: o Proficient in Python and deep learning libraries such as PyTorch, TensorFlow, or JAX o Experience with simulation platforms such as Isaac Sim, CARLA, MuJoCo, or Habitat o Skilled in developing scalable pipelines using Docker, CI/CD, and multi-GPU/cloud infrastructure o Proven ability to drive research projects and turning research into practical innovation o Collaborative and interdisciplinary mindset, able to work across research and product teams at Bosch

  • Personality and Working Practice: you have proven ability to drive research projects and turning research into practical innovation; collaborative and interdisciplinary mindset, you are able to work across research and product teams at Bosch
  • Languages: fluent English, German is optional

About the company

At Bosch, we shape the future by inventing high-quality technologies and services that spark enthusiasm and enrich people's lives. Our promise to our associates is rock-solid: We grow together, we enjoy our work, and we inspire each other. Welcome to Bosch.

Apply for this position