Svetlin Penkov
Debugging Machine Learning Code
#1about 6 minutes
The core challenge of debugging machine learning code
Machine learning models are defined by complex computations on high-dimensional data, making traditional debugging methods ineffective.
#2about 4 minutes
Why you should verify code correctness before redesigning models
Poor model performance is often caused by simple code bugs rather than flawed model architecture, a common oversight in the R&D cycle.
#3about 4 minutes
Distinguishing between semantic and runtime bugs in development
The development process involves two distinct feedback loops for handling semantic bugs from model translation and runtime bugs from data issues.
#4about 9 minutes
Limitations of traditional debugging methods for ML
Standard techniques like printing variables, plotting, and custom dashboards fail to provide insight into the complex, high-dimensional state of modern ML models.
#5about 5 minutes
Introducing FMRI for interactive 3D data visualization
The FMRI debugger allows you to inspect high-dimensional tensors visually in 3D, making it easy to understand complex data structures with a single line of code.
#6about 8 minutes
Visualizing a CNN's computational graph with FMRI scan
By wrapping a training loop with the scan function, FMRI automatically generates an interactive 3D computational graph of a PyTorch model.
#7about 3 minutes
Scaling visual debugging and using automated assertions
FMRI handles large-scale models like VGG19 and includes a library of assertions to automatically detect common issues like vanishing gradients or invalid inputs.
#8about 6 minutes
Live demo of debugging a CNN with FMRI assertions
A live demonstration shows how to inspect a 3D tensor and use FMRI's built-in assertions to instantly find the root cause of NaN errors in a CNN.
#9about 3 minutes
Exploring the full computational graph of ResNet-101
This demonstration visualizes the entire ResNet-101 model, showcasing the tool's ability to handle massive computational graphs and explore learned features.
Related jobs
Jobs that call for the skills explored in this talk.
CARIAD
Berlin, Germany
Junior
Intermediate
Python
C++
+1
Wilken GmbH
Ulm, Germany
Senior
Amazon Web Services (AWS)
Kubernetes
+1
Matching moments
14:46 MIN
Key takeaways and Q&A on debugger internals
Debugging Unveiled: Exploring Debugger Internals and Hidden Gems
01:39 MIN
Using LLMs to understand and navigate codebases
How E.On productionizes its AI model & Implementation of Secure Generative AI.
03:01 MIN
Common challenges in developing machine learning applications
Data Fabric in Action - How to enhance a Stock Trading App with ML and Data Virtualization
01:47 MIN
Using AI for root cause analysis and fixes
Debugging in the Dark
05:56 MIN
Developer tools and learning resources for GPUs
Accelerating Python on GPUs
05:14 MIN
Challenge three: Ensuring machine learning models are robust
How Machine Learning is turning the Automotive Industry upside down
03:55 MIN
The future of developer tools in an AI-driven world
Are frameworks like React redundant in an AI world?
03:27 MIN
Understanding the new AI developer stack and MLOps workflow
Developer Experience, Platform Engineering and AI powered Apps
Featured Partners
Related Videos
Overview of Machine Learning in Python
Adrian Schmitt
Getting Started with Machine Learning
Alexandra Waldherr
From ML to LLM: On-device AI in the Browser
Nico Martin
The pitfalls of Deep Learning - When Neural Networks are not the solution
Adrian Spataru & Bohdan Andrusyak
Machine Learning for Software Developers (and Knitters)
Kris Howard
30 Golden Rules of Deep Learning Performance
Anirudh Koul
Machine learning in the browser with TensorFlowjs
Håkan Silfvernagel
Multimodal Generative AI Demystified
Ekaterina Sirazitdinova
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.


Paris-based
Paris, France
Python
Docker
TensorFlow
Kubernetes
Computer Vision
+2



Descriptiona Venture-backed Deep-tech
Charing Cross, United Kingdom
ETL
Python
Docker
PyTorch
Machine Learning

Veriff
Barcelona, Spain
Remote
€70-90K
Computer Vision
Machine Learning

Leuven MindGate
Leuven, Belgium
Python
PyTorch
TensorFlow
Machine Learning


Inria
Canton de Villeneuve-d’Ascq, France
Remote
GIT
Python
Unit Testing
Data analysis
+1