AI Engineer - Reinforcement Learning

Blue Yonder Group, Inc.

Paris, France

8 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Job location

Paris, France

Tech stack

Artificial Intelligence

Python

Reinforcement Learning

PyTorch

Large Language Models

Data Pipelines

Job description

Design and implement RL environments for supply chain decision-making
Develop reward functions that capture what "good" looks like for our agents
Create evaluation frameworks to measure agent performance and catch failure modes
Build data pipelines for training and human feedback collection
Document what works (and what doesn't) so we can compound our learnings
Stay on top of industry trends and cutting edge use cases

Requirements

You've trained or fine-tuned LLMs
Are excited about AI-assisted tools and getting the most out of them
Build & customize your own AI workflows
Have experience working with AI agents and RL environments in production
Are proficient in Python and PyTorch
Can balance research exploration with shipping working code
Hands on experience with RL techniques (reward shaping, policy optimization, RLHF)
Thrive in fast-moving environments where priorities shift
Care about craft in your work
Are curious about why things work, not just that they work

Bonus points if:

You have experience with human-in-the-loop ML systems
You've built evaluation frameworks for open-ended tasks
You're familiar with supply chain, logistics, or operations domains
You have a side project that shows you can't stop tinkering

Our Values

If you want to know the heart of a company, take a look at their values. Ours unite us. They are what drive our success - and the success of our customers. Does your heart beat like ours? Find out here: Core Values