Ekaterina Sirazitdinova
Trends, Challenges and Best Practices for AI at the Edge
#1about 4 minutes
Defining AI at the edge and its industry applications
AI at the edge involves running computations on devices near the data source, transforming industries like manufacturing, retail, and healthcare.
#2about 1 minute
Understanding the unique constraints of edge devices
Edge devices differ from data centers due to their limited compute power, smaller storage capacity, and restricted power consumption.
#3about 2 minutes
Overcoming the primary challenges of edge AI development
Developers must solve for three main challenges: achieving high model accuracy, ensuring real-time throughput, and managing deployment at scale.
#4about 1 minute
Using synthetic data to improve model accuracy
Synthetic data helps improve model accuracy by providing diverse training examples, covering rare corner cases, and reducing expensive manual labeling.
#5about 3 minutes
Optimizing models with quantization and network pruning
Model performance can be significantly improved by using quantization to reduce numerical precision and network pruning to remove unnecessary neurons.
#6about 4 minutes
Advanced techniques for boosting inference performance
Further performance gains can be achieved through network graph optimizations, kernel auto-tuning, dynamic tensor memory, and multistream concurrent execution.
#7about 1 minute
NVIDIA's platform for the end-to-end AI workflow
NVIDIA provides a comprehensive software platform to support the entire AI productization cycle, from data collection and training to optimization and deployment.
#8about 2 minutes
Using Replicator and pre-trained models for development
NVIDIA Replicator generates synthetic data for training, while the NGC catalog offers a wide range of pre-trained models to accelerate development.
#9about 2 minutes
Training and fine-tuning models with the TAO Toolkit
The NVIDIA TAO Toolkit is a zero-coding framework that simplifies training, fine-tuning, pruning, and quantization of AI models.
#10about 2 minutes
Deploying models with TensorRT and Triton Inference Server
NVIDIA TensorRT optimizes models for high-performance inference, while Triton Inference Server provides a flexible solution for serving models at scale.
#11about 2 minutes
Building video analytics pipelines with DeepStream SDK
The NVIDIA DeepStream SDK, built on GStreamer, enables the creation of efficient, GPU-accelerated video analytics pipelines with zero memory copies.
#12about 2 minutes
Matching edge AI challenges with NVIDIA's solutions
A summary of how NVIDIA's tools like Replicator, TAO Toolkit, TensorRT, and DeepStream address the core challenges of accuracy, performance, and deployment.
Related jobs
Jobs that call for the skills explored in this talk.
Wilken GmbH
Ulm, Germany
Senior
Kubernetes
AI Frameworks
+3
Matching moments
03:28 MIN
Why corporate AI adoption lags behind the hype
What 2025 Taught Us: A Year-End Special with Hung Lee
03:15 MIN
The future of recruiting beyond talent acquisition
What 2025 Taught Us: A Year-End Special with Hung Lee
04:27 MIN
Moving beyond headcount to solve business problems
What 2025 Taught Us: A Year-End Special with Hung Lee
05:18 MIN
Incentivizing automation with a 'keep what you kill' policy
What 2025 Taught Us: A Year-End Special with Hung Lee
03:48 MIN
Automating formal processes risks losing informal human value
What 2025 Taught Us: A Year-End Special with Hung Lee
02:44 MIN
Rapid-fire thoughts on the future of work
What 2025 Taught Us: A Year-End Special with Hung Lee
03:39 MIN
Breaking down silos between HR, tech, and business
What 2025 Taught Us: A Year-End Special with Hung Lee
03:38 MIN
Balancing the trade-off between efficiency and resilience
What 2025 Taught Us: A Year-End Special with Hung Lee
Featured Partners
Related Videos
How AI Models Get Smarter
Ankit Patel
Bringing AI Everywhere
Stephan Gillich
The Future of Computing: AI Technologies in the Exascale Era
Stephan Gillich, Tomislav Tipurić, Christian Wiebus & Alan Southall
WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA
Ankit Patel
Multimodal Generative AI Demystified
Ekaterina Sirazitdinova
How computers learn to see – Applying AI to industry
Antonia Hahn
Performant Architecture for a Fast Gen AI User Experience
Nathaniel Okenwa
AI Factories at Scale
Thomas Schmidt
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

Forschungszentrum Jülich GmbH
Jülich, Germany
Intermediate
Senior
Linux
Docker
AI Frameworks
Machine Learning

autonomous-teaming
München, Germany
Remote
C++
GIT
Linux
Python
+1

autonomous-teaming
Canton of Toulouse-5, France
Remote
C++
GIT
Linux
Python
+1

NVIDIA Corporation
Remote
Senior
C++
DevOps
Python
Docker
+1

NVIDIA
Municipality of Madrid, Spain
Senior
C++
DevOps
Python
Docker
Kubernetes

European Tech Recruit
Municipality of Vitoria-Gasteiz, Spain
Intermediate
C++
GIT
Python
Machine Learning

European Tech Recruit
Municipality of Madrid, Spain
Intermediate
C++
GIT
Python
Machine Learning

European Tech Recruit
Municipality of Bilbao, Spain
Intermediate
C++
GIT
Python
Machine Learning

Edgeless Systems GmbH
Berlin, Germany
€80-120K
Senior
API
React
Python
TypeScript
+3