Raymond Camden
Generate AI in the Browser with Chrome AI - Raymond Camden
#1about 1 minute
Introduction to generative AI in the browser
The speaker sets expectations for the talk, assuming basic familiarity with generative AI and JavaScript, and provides links to the presentation materials.
#2about 5 minutes
Understanding the fundamentals of Chrome AI
Chrome AI integrates the Gemini Nano model directly into the browser for task-focused operations, requiring progressive enhancement and a one-time model download.
#3about 3 minutes
Exploring the features of the Chrome AI APIs
The APIs are available for both browsers and extensions, supporting features like streaming, sessions for conversational context, and multimodal input with images and audio.
#4about 3 minutes
Using the translator and language detector APIs
The translator API converts text between languages, while the language detector API identifies the language of a given text and provides a confidence score.
#5about 2 minutes
How to use the summarizer API for text
The summarizer API can generate different styles of summaries, such as key points or headlines, but may sometimes include external context not present in the original text.
#6about 3 minutes
Generating and correcting text with built-in APIs
The writer and rewriter APIs generate or transform text based on tone and length, while the new proofreader API identifies spelling and grammar errors.
#7about 1 minute
Leveraging the flexible general purpose prompt API
The prompt API offers a flexible, session-based interface for general-purpose tasks, supporting system instructions, structured output, and multimodal inputs like images and audio.
#8about 7 minutes
A three-step guide to implementing Chrome AI
Implementing any Chrome AI feature involves checking if the API exists, verifying its availability, and then creating an instance while handling the model download progress.
#9about 4 minutes
Live demos of the translator and summarizer APIs
A demonstration shows the translator API converting English to Mandarin and the summarizer API condensing the Gettysburg Address, highlighting its speed and options.
#10about 4 minutes
Demonstrating the rewriter and prompt APIs
The rewriter API is used to make text more casual and shorter, while the prompt API analyzes the content of images to generate detailed descriptions.
#11about 2 minutes
Enhancing image analysis with geolocation data
Combining the prompt API with EXIF geolocation data from an image allows the model to generate significantly more context-aware and accurate descriptions.
#12about 2 minutes
Final resources and where to learn more
The presentation concludes with links to official documentation, online playgrounds for testing, and information on joining the early preview program for updates.
Related jobs
Jobs that call for the skills explored in this talk.
Wilken GmbH
Ulm, Germany
Senior
Kubernetes
AI Frameworks
+3
Eltemate
Amsterdam, Netherlands
Intermediate
Senior
TypeScript
Continuous Integration
+1
Matching moments
06:44 MIN
Using Chrome's built-in AI for on-device features
Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3
06:33 MIN
The security challenges of building AI browser agents
AI in the Open and in Browsers - Tarek Ziadé
02:49 MIN
Using AI to overcome challenges in systems programming
AI in the Open and in Browsers - Tarek Ziadé
04:56 MIN
Recreating React components using AI and dev tools
WeAreDevelopers LIVE – AI, Freelancing, Keeping Up with Tech and More
09:10 MIN
How AI is changing the freelance developer experience
WeAreDevelopers LIVE – AI, Freelancing, Keeping Up with Tech and More
08:40 MIN
Integrating AI into Firefox while respecting user privacy
AI in the Open and in Browsers - Tarek Ziadé
03:07 MIN
Final advice for developers adapting to AI
WeAreDevelopers LIVE – AI, Freelancing, Keeping Up with Tech and More
03:16 MIN
Improving the developer feedback loop with specialized tools
Developer Time Is Valuable - Use the Right Tools - Kilian Valkhof
Featured Partners
Related Videos
WeAreDevelopers LIVE – AI vs the Web & AI in Browsers
Chris Heilmann, Daniel Cranney & Raymond Camden
Exploring Google Gemini and Generative AI
Exploring the Future of Web AI with Google
Thomas Steiner
Prompt API & WebNN: The AI Revolution Right in Your Browser
Christian Liebel
Generative AI power on the web: making web apps smarter with WebGPU and WebNN
Christian Liebel
AI is an Electric Bike for the Brain - Stoyan Stefanov
From ML to LLM: On-device AI in the Browser
Nico Martin
Google Gemini: Open Source and Deep Thinking Models - Sam Witteveen
Sam Witteveen
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

Google
Charing Cross, United Kingdom
Senior
Google Cloud Platform

Amazon.com, Inc
Shoreham-by-Sea, United Kingdom
XML
HTML
JSON
Python
Data analysis
+1


Amazon.com Inc.
XML
HTML
JSON
Python
Data analysis
+1

Amazon.com Inc.
XML
HTML
JSON
Python
Data analysis
+1

Advanced Micro Devices
Amsterdam, Netherlands
C++
OpenCL
Docker
PyTorch
Kubernetes
+1

The Rolewe
Charing Cross, United Kingdom
API
Python
Machine Learning

Microsoft
Cambridge, United Kingdom
PyTorch
Machine Learning

The Writer
London
Contract
Published: 19 hours ago
Competitive
Charing Cross, United Kingdom
Senior
API
REST
Azure
React
DevOps
+9