Francis Powlesland & Elena Kotljarova

On the straight and narrow path - How to get cars to drive themselves using reinforcement learning and trajectory optimization

From erratic exploration to a perfectly optimized path in 150 laps. See how a race car learns to drive itself using reinforcement learning.

On the straight and narrow path - How to get cars to drive themselves using reinforcement learning and trajectory optimization
#1about 5 minutes

A novel approach to self-driving cars

This project uses reinforcement learning to enable a car to learn on-the-drive, unlike pre-trained models that rely on static data.

#2about 3 minutes

Setting a baseline with a human driver

A human driver completes three laps on the physical racetrack to establish a benchmark time for the AI to compete against.

#3about 5 minutes

Observing the AI learn across 1500 laps

The AI's driving behavior evolves from random and unstable after 15 laps to smooth and optimized after 150, showing diminishing returns by 1500 laps.

#4about 6 minutes

Understanding the core concepts of reinforcement learning

A recap of the demo's results leads into an explanation of reinforcement learning's core ideas like agents, environments, actions, and maximizing rewards.

#5about 5 minutes

Applying Q-learning with states, actions, and Q-tables

Q-learning uses a table of states and actions to store learned values, making it easy to inspect and update the agent's knowledge.

#6about 4 minutes

Key parameters for tuning the Q-learning algorithm

The algorithm's behavior is controlled by key parameters like the learning rate (alpha), discount factor, and the exploration factor (epsilon).

#7about 1 minute

The technical architecture of the race track demo

The demo integrates PS4 controllers, an Arduino, the Watson IoT platform, a Node.js backend, and a React.js frontend.

#8about 3 minutes

Real-world application with Thyssen Krupp

A collaboration with Thyssen Krupp applies these reinforcement learning concepts to a full-size vehicle to learn and adapt its driving style.

#9about 7 minutes

Q&A on data, constraints, and local optima

The speakers answer audience questions about the importance of data quality, how the car stays on the track, and how the algorithm avoids local optima.

Related jobs
Jobs that call for the skills explored in this talk.

Featured Partners

Related Articles

View all articles
CH
Chris Heilmann
With AIs wide open - WeAreDevelopers at All Things Open 2025
Last week our VP of Developer Relations, Chris Heilmann, flew to Raleigh, North Carolina to present at All Things Open . An excellent event he had spoken at a few times in the past and this being the “Lucky 13” edition, he didn’t hesitate to come and...
With AIs wide open - WeAreDevelopers at All Things Open 2025
DC
Daniel Cranney
How software is steering vehicle technology
The automotive industry is entering a transformative era, and developers have a unique opportunity to be part of it. Cars are no longer just mechanical machines; they’re sophisticated tech platforms with software at their core. This shift, defined by...
How software is steering vehicle technology

From learning to earning

Jobs that call for the skills explored in this talk.

AI Engineer

AI Engineer

The Automobile Association
Charing Cross, United Kingdom

Python
Machine Learning
Continuous Integration