Christian Liebel
Generative AI power on the web: making web apps smarter with WebGPU and WebNN
#1about 1 minute
Generative AI use cases and cloud provider limitations
Cloud-based AI faces challenges like required internet connectivity, data privacy risks, and high costs, creating a need for local alternatives.
#2about 13 minutes
Running large language models locally with Web LLM
Web LLM enables running multi-gigabyte language models like Llama 3 directly in the browser for offline use, despite initial download and initialization times.
#3about 2 minutes
The technology behind in-browser AI execution
In-browser AI performance is accelerated by combining WebAssembly for efficient computation and the new WebGPU API for direct access to the system's GPU.
#4about 4 minutes
Boosting performance with the upcoming WebNN API
The Web Neural Network (WebNN) API provides access to dedicated Neural Processing Units (NPUs) for even faster, more efficient on-device model inference.
#5about 6 minutes
Solving model duplication with the new Prompt API
The experimental Prompt API addresses the issue of redundant model downloads by allowing websites to access a single, shared OS-level model like Gemini Nano.
#6about 3 minutes
Using the Prompt API for on-device data extraction
A demonstration shows how the Prompt API can use a local model to accurately extract structured data from unstructured text, highlighting its practical application.
#7about 2 minutes
Generating images in the browser with WebSD
WebSD brings text-to-image generation to the browser by running Stable Diffusion models locally using WebGPU, enabling creative AI tasks without cloud dependency.
#8about 1 minute
Weighing the pros and cons of local AI models
Local AI models offer superior privacy, offline availability, and low cost, but come with trade-offs like lower quality, high system requirements, and slower performance.
#9about 1 minute
The future of on-device AI in web development
While cloud-based models are currently superior, the trend towards more compact open-source models and OS-integrated AI suggests a growing role for local AI in specialized web applications.
Related jobs
Jobs that call for the skills explored in this talk.
Wilken GmbH
Ulm, Germany
Senior
Kubernetes
AI Frameworks
+3
Sunhat
Köln, Germany
Remote
€85-115K
Senior
Team Leadership
Software Architecture
+1
Matching moments
04:57 MIN
Increasing the value of talk recordings post-event
Cat Herding with Lions and Tigers - Christian Heilmann
02:54 MIN
Automating video post-production with local scripts
Cat Herding with Lions and Tigers - Christian Heilmann
06:44 MIN
Using Chrome's built-in AI for on-device features
Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3
03:28 MIN
Why corporate AI adoption lags behind the hype
What 2025 Taught Us: A Year-End Special with Hung Lee
03:15 MIN
The future of recruiting beyond talent acquisition
What 2025 Taught Us: A Year-End Special with Hung Lee
04:27 MIN
Moving beyond headcount to solve business problems
What 2025 Taught Us: A Year-End Special with Hung Lee
02:44 MIN
Rapid-fire thoughts on the future of work
What 2025 Taught Us: A Year-End Special with Hung Lee
03:48 MIN
Automating formal processes risks losing informal human value
What 2025 Taught Us: A Year-End Special with Hung Lee
Featured Partners
Related Videos
Prompt API & WebNN: The AI Revolution Right in Your Browser
Christian Liebel
Privacy-first in-browser Generative AI web apps: offline-ready, future-proof, standards-based
Maxim Salnikov
From ML to LLM: On-device AI in the Browser
Nico Martin
Exploring the Future of Web AI with Google
Thomas Steiner
Generate AI in the Browser with Chrome AI - Raymond Camden
Raymond Camden
AI: Superhero or Supervillain? How and Why with Scott Hanselman
Scott Hanselman
WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA
Ankit Patel
Performant Architecture for a Fast Gen AI User Experience
Nathaniel Okenwa
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

Forschungszentrum Jülich GmbH
Jülich, Germany
Intermediate
Senior
Linux
Docker
AI Frameworks
Machine Learning

UL Solutions
Barcelona, Spain
Python
Machine Learning

University of the Arts, London
Sleaford, United Kingdom
£34-41K
Python
PyTorch
TensorFlow

autonomous-teaming
München, Germany
Remote
C++
GIT
Linux
Python
+1


Conrad Electronic SE
Hirschau, Germany
Azure
Python
Google Cloud Platform

Accenture
Charing Cross, United Kingdom
REST
React
GraphQL
React Native
Continuous Integration

RE-INvent Retail GmbH
Azure
Python
Microservices
Google Cloud Platform

Descripción De La Vacante
€40-70K
Azure
Python
PyTorch
TensorFlow
+1