Ankit Patel
How AI Models Get Smarter
#1about 2 minutes
How AI models are surpassing human experts
AI models are now exceeding human expert performance on comprehensive benchmarks like MMLU, which measures intelligence across various subjects.
#2about 5 minutes
The shift from labeled to unlabeled data training
The transformer architecture enabled a major shift from training on limited, human-labeled data to pre-training on vast amounts of unlabeled internet text using next-token prediction.
#3about 8 minutes
Refining models with post-training techniques
Pre-trained models are made useful for specific tasks like chatbots through post-training methods such as supervised fine-tuning and reinforcement learning from human feedback (RLHF).
#4about 3 minutes
Improving answer quality with reasoning models
Reasoning models improve accuracy by using test-time scaling, a process where the model prompts itself to double-check facts and logic before providing a final answer.
#5about 5 minutes
A practical workflow for AI application developers
Developers can build AI applications by starting with an API, using structured prompt engineering, and evaluating models in context rather than relying solely on benchmarks.
#6about 3 minutes
Implementing guardrails to secure your application
Protect your AI application from manipulation and misuse by implementing guardrails, detailed system prompts, and specialized guard models to enforce desired behaviors.
#7about 3 minutes
Building modular agentic applications with tools
Agentic applications use a modular architecture where each agent can use specific tools, often defined with natural language prompts, to perform complex tasks.
#8about 4 minutes
Q&A on model behavior and synthetic data
This Q&A covers why LLM responses are non-deterministic, how synthetic data is used for model distillation, and strategies for preventing hallucinations.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
26:27 MIN
Managing the rapid pace of AI development and its impact
From Monolith Tinkering to Modern Software Development
09:55 MIN
Shifting from traditional code to AI-powered logic
WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA
22:05 MIN
Analyzing the risks and architecture of current AI models
Opening Keynote by Sir Tim Berners-Lee
00:19 MIN
The developer's journey for building AI applications
Supercharge your cloud-native applications with Generative AI
18:07 MIN
Why AI optimizations increase the demand for compute
Coffee with Developers - Stephen Jones - NVIDIA
14:35 MIN
Developing essential AI and human skills for modern teams
Breaking Silos: Successful Collaboration Between Tech & Business Teams in Complex Enterprise Systems
31:13 MIN
The future of on-device AI in web development
Generative AI power on the web: making web apps smarter with WebGPU and WebNN
20:05 MIN
The limitations and potential of AI models
Coffee with Developers - Cassidy Williams -
Featured Partners
Related Videos
AI: Superhero or Supervillain? How and Why with Scott Hanselman
Scott Hanselman
Bringing the power of AI to your application.
Krzysztof Cieślak
You are not an AI developer
Zan Markan
Open Source AI, To Foundation Models and Beyond
Ankit Patel, Matt White, Philipp Schmid, Lucie-Aimée Kaffee & Andreas Blattmann
AI & Ethics
PJ Hagerty
WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA
Ankit Patel
The shadows of reasoning – new design paradigms for a gen AI world
Jonas Andrulis
The State of GenAI & Machine Learning in 2025
Alejandro Saucedo
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

AI Systems and MLOps Engineer for Earth Observation
Forschungszentrum Jülich GmbH
Jülich, Germany
Intermediate
Senior
Linux
Docker
AI Frameworks
Machine Learning

Lead Fullstack Engineer AI
Hubert Burda Media
München, Germany
€80-95K
Intermediate
React
Python
Vue.js
Langchain
+1

AI Language Engineer, Artificial General Intelligence - Data Services
Amazon.com Inc.
Senior
R
API
Unix
Perl
Ruby
+7

AI Language Engineer, Artificial General Intelligence - Data Services
Amazon.com Inc.
Senior
R
API
Python
Matlab
Bootstrap
+4

Senior Machine Learning Engineer - AI Systems & Frameworks
10xEngineers
Remote
Senior
API
C++
Python
PyTorch
+4

Agentic AI Architect - Python, LLMs & NLP
FRG Technology Consulting
Intermediate
Azure
Python
Machine Learning



AI/ML Data Scientist: Build & Deploy Generative Models
UL Solutions
Barcelona, Spain
Python
Machine Learning