Research Engineer - Vision Language Action Models for Intelligent Cyber Physical Systems iv.)
Role details
Job location
Tech stack
Job description
- As a Research Engineer, you will develop cutting-edge Vision-Language-Action (VLA) architectures that empower AI agents to interpret human instructions and act autonomously in complex environments.
- Furthermore, you will connect multimodal representation learning with long-term control and planning to create agents that go beyond reactive capabilities and exhibit cognitive intelligence.
- You will contribute to fundamental and applied research on novel VLA models and actively drive their advancement.
- Building a scalable infrastructure for experimentation, training, and deployment, including the development of training pipelines, simulation tools, and evaluation methods will be part of your task.
- You will make VLA methods usable for concrete Bosch applications in practice and demonstrate their superior flexibility and generalization capabilities.
- Last but not least, you will work closely with interdisciplinary teams of researchers and application experts to shape the early innovation phase and the long-term strategy for intelligent automation at Bosch.
Requirements
Do you have a Master's degree?, + excellent MSc in Computer Science, Machine Learning, Robotics or related technical fields
- PhD in Multimodal AI, Robotics, Reinforcement Learning, or Generative AI preferred
- demonstrated academic excellence, with a strong publication record in leading AI and robotics conferences and journals (NeurIPS, ICLR, ICML, CVPR, CoRL, RSS, ICRA, ACL, EMNLP, etc.)
- Experience and Knowledge:
-
Industrial SW development experience: o Demonstrated industry-relevant practical AI experience, e.g. by code contributions in industrial AI applications, in large scale machine learning projects, or participation in large-scale AI benchmarks and contests o Multiple years of experience in developing and deploying machine learning solutions in distributed SW development teams o Demonstrated capability to go beyond research prototypes and integrate cutting-edge AI methods into practically relevant and usable SW solutions
-
Multimodal AI & Vision-Language Models: o Proficiency in designing and training vision-language models (VLMs) and language-multimodal models (LMMs) (e.g., Flamingo, GPT-4V, PaLM-E, RT-2) o Hands-on experience with visual grounding, cross-modal attention, and instruction-following architectures o Familiarity with benchmarks such as ALFRED, Ego4D, VLN, or datasets involving perception and action
-
Control, Reinforcement Learning & Planning: o Strong experience in reinforcement learning, imitation learning, or model-based control for agents operating in real or simulated environments o Capability in designing agents for long-horizon planning, semantic task decomposition, and hierarchical control o Knowledge of methods that combine perception, language, and action in task-driven settings
-
AI for Cyber-Physical Systems & Automation: o Demonstrated capability to integrate cutting-edge AI methods into practical applications in robotics, automated o driving, industrial automation, or building systems interest in emerging domains such as smart heating and HVAC control, where semantic AI can optimize control strategies o Focus on building robust, explainable, and semantically grounded agents for physical deployment
-
Infrastructure, Simulation & Tooling: o Proficient in Python and deep learning libraries such as PyTorch, TensorFlow, or JAX o Experience with simulation platforms such as Isaac Sim, CARLA, MuJoCo, or Habitat o Skilled in developing scalable pipelines using Docker, CI/CD, and multi-GPU/cloud infrastructure o Proven ability to drive research projects and turning research into practical innovation o Collaborative and interdisciplinary mindset, able to work across research and product teams at Bosch
- Personality and Working Practice: you have proven ability to drive research projects and turning research into practical innovation; collaborative and interdisciplinary mindset, you are able to work across research and product teams at Bosch
- Languages: fluent English, German is optional