Privacy-first in-browser Generative AI web apps: offline-ready, future-proof, standards-based
What if you could run powerful AI in a web app with total user privacy, completely offline? New browser APIs make it possible.
#1about 3 minutes
A demo of client-side AI using the NPU
A computer vision application performs image classification directly in the browser without any backend calls by leveraging the device's Neural Processing Unit (NPU).
#2about 3 minutes
The case for privacy-first, on-device AI
On-device AI meets user demands for performance, privacy, and offline access while satisfying developer needs for a unified codebase and helpful abstractions.
#3about 3 minutes
Introducing the Web Neural Network (WebNN) standard
The emerging WebNN standard provides a model-agnostic, unified abstraction for near-native AI execution in the browser, designed around practical use cases.
#4about 4 minutes
Leveraging hardware like the CPU, GPU, and NPU
WebNN can access all available hardware, with the NPU offering a power-efficient alternative to the GPU for sustained AI workloads on mobile devices.
#5about 6 minutes
Getting started with the low-level WebNN API
To experiment with the emerging WebNN standard, developers must use canary browser versions and enable specific flags, but its low-level API can be complex.
#6about 7 minutes
Simplifying development with high-level AI frameworks
Frameworks like ONNX Runtime Web and Transformers.js provide higher-level, task-based abstractions over WebNN, making it easier for app developers to build AI features.
#7about 3 minutes
Best practices and the future of browser AI
Focus on user experience by providing fallbacks and progress indicators, and look ahead to upcoming built-in browser APIs like the Prompt API that abstract away model management.
#8about 2 minutes
Demo code and using web workers for performance
The demo applications are built as offline-ready Progressive Web Apps and use Web Workers to run intensive AI computations without freezing the main UI thread.
Related jobs
Jobs that call for the skills explored in this talk.
The Web We Broke (And Why AI Agents Are Paying the Price) - AgentCon BerlinThis is the accompanying post to the talk Chris Heilmann gave at AgentCon in Berlin on 19/05/2026, you can also see the slides and listen to it in this screencast:
Thirty years of developer shortcuts, bloated JavaScript, and inaccessible HTML have l...
Adrien Book
How AI Will Eat The World 🤖Of generative-AI-for-everything and synthetic pleasuresRemember the web3 hype? Tech bros with easy access to cheap liquidity wanted to create a decentralised, peer-to-peer internet powered by blockchain technology. Spoiler alert, it did not work. And...
Dev Digest 198: 30 years of JS, In-Browser AI, How Attackers Abuse GenAI Inside last week’s Dev Digest 198 .
🎂 30 years of JavaScript
⏰ How long is a JavaScript second
💻 Clean code in Angular
🤦♂️ AI makes different mistakes than humans
👨💻 In-browser and offline AI
🟠 Undocumented Hacker News features
🐋 DeepSeek censored...
From learning to earning
Jobs that call for the skills explored in this talk.