Paul Graham
Accelerating Python on GPUs
#1about 2 minutes
The rise of general-purpose GPU computing
NVIDIA's evolution from a graphics hardware company to a leader in general-purpose computing was accelerated by the use of GPUs for AI with models like AlexNet.
#2about 4 minutes
Why GPUs outperform CPUs for parallel tasks
As single-threaded CPU performance plateaued, GPUs offered a path forward with their massively parallel architecture designed for simultaneous computation.
#3about 6 minutes
Understanding modern GPU architecture and operation
GPUs work with CPUs by offloading compute-intensive code and use thousands of threads to hide memory latency, leveraging streaming multiprocessors and high-bandwidth memory.
#4about 7 minutes
Introducing the CUDA parallel computing platform
The CUDA platform is a complete ecosystem with compilers, libraries, and frameworks that enables developers to program GPUs using various languages and abstraction levels.
#5about 3 minutes
Leveraging specialized hardware like Tensor Cores
Specialized hardware like Tensor Cores can be used transparently through high-level libraries like cuDNN or programmed directly with low-level APIs for maximum performance.
#6about 6 minutes
High-level frameworks for domain-specific acceleration
Frameworks like Rapids provide GPU-accelerated, drop-in replacements for popular data science libraries such as Pandas (cuDF) and NetworkX (cuGraph) with minimal code changes.
#7about 10 minutes
A progressive approach to programming GPUs in Python
Developers can choose from a spectrum of Python libraries, from simple drop-in replacements like CuNumeric and CuPy to JIT compilers like Numba and direct kernel programming with PyCUDA.
#8about 6 minutes
Developer tools and learning resources for GPUs
NVIDIA offers a comprehensive suite of developer tools for profiling and debugging, along with learning resources like the NGC repository, DLI courses, and community events.
Related jobs
Jobs that call for the skills explored in this talk.
Picnic Technologies B.V.
Amsterdam, Netherlands
Intermediate
Senior
Python
Structured Query Language (SQL)
+1
Matching moments
04:09 MIN
How Python became the dominant language for AI
AI in the Open and in Browsers - Tarek Ziadé
03:55 MIN
The hardware requirements for running LLMs locally
AI in the Open and in Browsers - Tarek Ziadé
02:49 MIN
Using AI to overcome challenges in systems programming
AI in the Open and in Browsers - Tarek Ziadé
02:20 MIN
The evolving role of the machine learning engineer
AI in the Open and in Browsers - Tarek Ziadé
04:57 MIN
Increasing the value of talk recordings post-event
Cat Herding with Lions and Tigers - Christian Heilmann
01:32 MIN
Organizing a developer conference for 15,000 attendees
Cat Herding with Lions and Tigers - Christian Heilmann
04:28 MIN
Building an open source community around AI models
AI in the Open and in Browsers - Tarek Ziadé
06:44 MIN
Using Chrome's built-in AI for on-device features
Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3
Featured Partners
Related Videos
Accelerating Python on GPUs
Paul Graham
Accelerating Python on GPUs
Paul Graham
CUDA in Python
Andy Terrel
WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA
Ankit Patel
Coffee with Developers - Stephen Jones - NVIDIA
Stephen Jones
Concurrency in Python
Fabian Schindler
Vectorize all the things! Using linear algebra and NumPy to make your Python code lightning fast.
Jodie Burchell
30 Golden Rules of Deep Learning Performance
Anirudh Koul
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

Nvidia
Bramley, United Kingdom
C++
PyTorch
TensorFlow

Advanced Micro Devices
Amsterdam, Netherlands
C++
OpenCL
Docker
PyTorch
Kubernetes
+1


Nvidia
Bramley, United Kingdom
£292K
Senior
C++
Linux
Node.js
PyTorch
+1

Corriculo Ltd
Reading, United Kingdom
Remote
£40-60K
GIT
Linux
NumPy
+8


Corriculo Ltd
Oxford, United Kingdom
Remote
£40-60K
GIT
Linux
NumPy
+8

Corriculo Ltd
Oxford, United Kingdom
Remote
£60K
GIT
Linux
NumPy
+8

Corriculo Ltd
Milton, United Kingdom
Remote
£40-60K
GIT
Linux
NumPy
+8