AI Engineer - Reinforcement Learning
Blue Yonder Group, Inc.
Paris, France
8 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
EnglishJob location
Paris, France
Tech stack
Artificial Intelligence
Python
Reinforcement Learning
PyTorch
Large Language Models
Data Pipelines
Job description
- Design and implement RL environments for supply chain decision-making
- Develop reward functions that capture what "good" looks like for our agents
- Create evaluation frameworks to measure agent performance and catch failure modes
- Build data pipelines for training and human feedback collection
- Document what works (and what doesn't) so we can compound our learnings
- Stay on top of industry trends and cutting edge use cases
Requirements
- You've trained or fine-tuned LLMs
- Are excited about AI-assisted tools and getting the most out of them
- Build & customize your own AI workflows
- Have experience working with AI agents and RL environments in production
- Are proficient in Python and PyTorch
- Can balance research exploration with shipping working code
- Hands on experience with RL techniques (reward shaping, policy optimization, RLHF)
- Thrive in fast-moving environments where priorities shift
- Care about craft in your work
- Are curious about why things work, not just that they work
Bonus points if:
- You have experience with human-in-the-loop ML systems
- You've built evaluation frameworks for open-ended tasks
- You're familiar with supply chain, logistics, or operations domains
- You have a side project that shows you can't stop tinkering
Our Values
If you want to know the heart of a company, take a look at their values. Ours unite us. They are what drive our success - and the success of our customers. Does your heart beat like ours? Find out here: Core Values