Ankit Patel
How AI Models Get Smarter
#1about 2 minutes
How AI models are surpassing human experts
AI models are now exceeding human expert performance on comprehensive benchmarks like MMLU, which measures intelligence across various subjects.
#2about 5 minutes
The shift from labeled to unlabeled data training
The transformer architecture enabled a major shift from training on limited, human-labeled data to pre-training on vast amounts of unlabeled internet text using next-token prediction.
#3about 8 minutes
Refining models with post-training techniques
Pre-trained models are made useful for specific tasks like chatbots through post-training methods such as supervised fine-tuning and reinforcement learning from human feedback (RLHF).
#4about 3 minutes
Improving answer quality with reasoning models
Reasoning models improve accuracy by using test-time scaling, a process where the model prompts itself to double-check facts and logic before providing a final answer.
#5about 5 minutes
A practical workflow for AI application developers
Developers can build AI applications by starting with an API, using structured prompt engineering, and evaluating models in context rather than relying solely on benchmarks.
#6about 3 minutes
Implementing guardrails to secure your application
Protect your AI application from manipulation and misuse by implementing guardrails, detailed system prompts, and specialized guard models to enforce desired behaviors.
#7about 3 minutes
Building modular agentic applications with tools
Agentic applications use a modular architecture where each agent can use specific tools, often defined with natural language prompts, to perform complex tasks.
#8about 4 minutes
Q&A on model behavior and synthetic data
This Q&A covers why LLM responses are non-deterministic, how synthetic data is used for model distillation, and strategies for preventing hallucinations.
Related jobs
Jobs that call for the skills explored in this talk.
Wilken GmbH
Ulm, Germany
Senior
Amazon Web Services (AWS)
Kubernetes
+1
CARIAD
Berlin, Germany
Junior
Intermediate
Python
C++
+1
Matching moments
06:46 MIN
Managing the rapid pace of AI development and its impact
From Monolith Tinkering to Modern Software Development
02:58 MIN
Shifting from traditional code to AI-powered logic
WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA
04:34 MIN
Analyzing the risks and architecture of current AI models
Opening Keynote by Sir Tim Berners-Lee
06:44 MIN
The developer's journey for building AI applications
Supercharge your cloud-native applications with Generative AI
03:27 MIN
Why AI optimizations increase the demand for compute
Coffee with Developers - Stephen Jones - NVIDIA
01:32 MIN
Developing essential AI and human skills for modern teams
Breaking Silos: Successful Collaboration Between Tech & Business Teams in Complex Enterprise Systems
01:14 MIN
The future of on-device AI in web development
Generative AI power on the web: making web apps smarter with WebGPU and WebNN
05:28 MIN
The limitations and potential of AI models
Coffee with Developers - Cassidy Williams -
Featured Partners
Related Videos
Open Source AI, To Foundation Models and Beyond
Ankit Patel, Matt White, Philipp Schmid, Lucie-Aimée Kaffee & Andreas Blattmann
A walkthrough on Responsible AI Frameworks and Case Studies
Toju Duke
AI: Superhero or Supervillain? How and Why with Scott Hanselman
Scott Hanselman
Bringing the power of AI to your application.
Krzysztof Cieślak
WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA
Ankit Patel
Staying Safe in the AI Future
Cassie Kozyrkov
Unlocking the Power of AI: Accessible Language Model Tuning for All
Cedric Clyburn & Legare Kerrison
Bringing AI Everywhere
Stephan Gillich
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.





FDTech GmbH
Boldecker Land, Germany
R
GIT
Python
A/B testing
Machine Learning
+1


Imec
Azure
Python
PyTorch
TensorFlow
Computer Vision
+1

Agenda GmbH
Raubling, Germany
Remote
Intermediate
API
Azure
Python
Docker
+10
