Senior Machine Learning Engineer - LLM & Reinforcement Learning

Warmwind

Jena, Germany

4 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Compensation

€ 240K

Job location

Jena, Germany

Tech stack

Artificial Intelligence

Artificial Neural Networks

Nvidia CUDA

InfiniBand

Machine Learning

Parallel Computing

TensorFlow

Reinforcement Learning

Graphics Processing Unit (GPU)

PyTorch

Large Language Models

Workday

Job description

As a Senior Machine Learning Engineer at warmwind, you will push the limits of AI by designing, training, and scaling state-of-the-art Large Language Models (LLMs) and advanced reinforcement learning (RL) systems. We are looking for an exceptional expert who has already built and deployed large-scale LLMs and understands every detail of the process-from tokenization to training at scale.

Your work will drive our next-gen AI models, shaping the future of machine intelligence beyond traditional paradigms. You will work with massive compute clusters (500+ H100 GPUs) and cutting-edge reinforcement learning techniques to create highly efficient, scalable, and groundbreaking AI systems.

Responsibilities

Design, train, and optimize Large Language Models (LLMs) from scratch
Scale distributed training on massive GPU clusters (500+ H100 GPUs)
Implement advanced reinforcement learning techniques (RLHF, adversarial self-play, real-time control)
Develop high-performance architectures for multi-modal AI systems
Build simulation environments for RL-based AI agents
Optimize inference speed and efficiency for real-world deployment
Collaborate with top AI researchers to push the boundaries of machine learning innovation

Your Profile Must-haves

Deep expertise in LLMs - you've built and trained large-scale models yourself
Experience with large-scale distributed training on 500+ GPU superclusters
Deep understanding of reinforcement learning, neural network optimization, and self-play methods
Expert in PyTorch, TensorFlow, JAX & low-level optimization techniques (CUDA, Triton, DeepSpeed, etc.)
Familiarity with high-performance computing (HPC, NVLink, InfiniBand, parallel computing)
Strong publication track record in AI/ML research is a plus
Relocation to Jena, Germany after initial onboarding

Company Culture and Work Style

We operate in a dynamic startup environment where speed, efficiency, and innovation are key to achieving our goals and growing together. Our development process is based on rapid iterations, allowing us to quickly implement and test ideas to enhance our product and meet user needs.

What we offer:

Innovation Opportunities: Work on cutting-edge technology and help shape the technical direction of our product.
Impact: Your contributions will directly influence the user experience and the success of our platform.
Startup Atmosphere: Flat hierarchies, direct communication, and a real opportunity to create something very big.
Fair Compensation: Performance-based payment with the opportunity to participate in the growth through success.
Flexible Work Conditions with Structure: We offer you high flexibility in shaping your workday-provided tasks and goals are met, you're free to design your workflow. At the same time, we value efficient collaboration during core working hours to move projects forward and facilitate quick discussions.