Senior Machine Learning Engineer - LLM & Reinforcement Learning

Warmwind
Jena, Germany
4 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
€ 240K

Job location

Jena, Germany

Tech stack

Artificial Intelligence
Artificial Neural Networks
Nvidia CUDA
InfiniBand
Machine Learning
Parallel Computing
TensorFlow
Reinforcement Learning
Graphics Processing Unit (GPU)
PyTorch
Large Language Models
Workday

Job description

As a Senior Machine Learning Engineer at warmwind, you will push the limits of AI by designing, training, and scaling state-of-the-art Large Language Models (LLMs) and advanced reinforcement learning (RL) systems. We are looking for an exceptional expert who has already built and deployed large-scale LLMs and understands every detail of the process-from tokenization to training at scale.

Your work will drive our next-gen AI models, shaping the future of machine intelligence beyond traditional paradigms. You will work with massive compute clusters (500+ H100 GPUs) and cutting-edge reinforcement learning techniques to create highly efficient, scalable, and groundbreaking AI systems.

Responsibilities

  • Design, train, and optimize Large Language Models (LLMs) from scratch
  • Scale distributed training on massive GPU clusters (500+ H100 GPUs)
  • Implement advanced reinforcement learning techniques (RLHF, adversarial self-play, real-time control)
  • Develop high-performance architectures for multi-modal AI systems
  • Build simulation environments for RL-based AI agents
  • Optimize inference speed and efficiency for real-world deployment
  • Collaborate with top AI researchers to push the boundaries of machine learning innovation

Your Profile Must-haves

  • Deep expertise in LLMs - you've built and trained large-scale models yourself
  • Experience with large-scale distributed training on 500+ GPU superclusters
  • Deep understanding of reinforcement learning, neural network optimization, and self-play methods
  • Expert in PyTorch, TensorFlow, JAX & low-level optimization techniques (CUDA, Triton, DeepSpeed, etc.)
  • Familiarity with high-performance computing (HPC, NVLink, InfiniBand, parallel computing)
  • Strong publication track record in AI/ML research is a plus
  • Relocation to Jena, Germany after initial onboarding

Company Culture and Work Style

We operate in a dynamic startup environment where speed, efficiency, and innovation are key to achieving our goals and growing together. Our development process is based on rapid iterations, allowing us to quickly implement and test ideas to enhance our product and meet user needs.

What we offer:

  • Innovation Opportunities: Work on cutting-edge technology and help shape the technical direction of our product.
  • Impact: Your contributions will directly influence the user experience and the success of our platform.
  • Startup Atmosphere: Flat hierarchies, direct communication, and a real opportunity to create something very big.
  • Fair Compensation: Performance-based payment with the opportunity to participate in the growth through success.
  • Flexible Work Conditions with Structure: We offer you high flexibility in shaping your workday-provided tasks and goals are met, you're free to design your workflow. At the same time, we value efficient collaboration during core working hours to move projects forward and facilitate quick discussions.

Requirements

Do you have a Master's degree?

Apply for this position