Raymond Camden
Generate AI in the Browser with Chrome AI - Raymond Camden
#1about 1 minute
Introduction to generative AI in the browser
The speaker sets expectations for the talk, assuming basic familiarity with generative AI and JavaScript, and provides links to the presentation materials.
#2about 5 minutes
Understanding the fundamentals of Chrome AI
Chrome AI integrates the Gemini Nano model directly into the browser for task-focused operations, requiring progressive enhancement and a one-time model download.
#3about 3 minutes
Exploring the features of the Chrome AI APIs
The APIs are available for both browsers and extensions, supporting features like streaming, sessions for conversational context, and multimodal input with images and audio.
#4about 3 minutes
Using the translator and language detector APIs
The translator API converts text between languages, while the language detector API identifies the language of a given text and provides a confidence score.
#5about 2 minutes
How to use the summarizer API for text
The summarizer API can generate different styles of summaries, such as key points or headlines, but may sometimes include external context not present in the original text.
#6about 3 minutes
Generating and correcting text with built-in APIs
The writer and rewriter APIs generate or transform text based on tone and length, while the new proofreader API identifies spelling and grammar errors.
#7about 1 minute
Leveraging the flexible general purpose prompt API
The prompt API offers a flexible, session-based interface for general-purpose tasks, supporting system instructions, structured output, and multimodal inputs like images and audio.
#8about 7 minutes
A three-step guide to implementing Chrome AI
Implementing any Chrome AI feature involves checking if the API exists, verifying its availability, and then creating an instance while handling the model download progress.
#9about 4 minutes
Live demos of the translator and summarizer APIs
A demonstration shows the translator API converting English to Mandarin and the summarizer API condensing the Gettysburg Address, highlighting its speed and options.
#10about 4 minutes
Demonstrating the rewriter and prompt APIs
The rewriter API is used to make text more casual and shorter, while the prompt API analyzes the content of images to generate detailed descriptions.
#11about 2 minutes
Enhancing image analysis with geolocation data
Combining the prompt API with EXIF geolocation data from an image allows the model to generate significantly more context-aware and accurate descriptions.
#12about 2 minutes
Final resources and where to learn more
The presentation concludes with links to official documentation, online playgrounds for testing, and information on joining the early preview program for updates.
Related jobs
Jobs that call for the skills explored in this talk.
Wilken GmbH
Ulm, Germany
Senior
Kubernetes
AI Frameworks
+3
Matching moments
06:44 MIN
Using Chrome's built-in AI for on-device features
Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3
06:33 MIN
The security challenges of building AI browser agents
AI in the Open and in Browsers - Tarek Ziadé
02:49 MIN
Using AI to overcome challenges in systems programming
AI in the Open and in Browsers - Tarek Ziadé
08:40 MIN
Integrating AI into Firefox while respecting user privacy
AI in the Open and in Browsers - Tarek Ziadé
05:26 MIN
Using AI prompts to rebuild a classic 8-bit game
WeAreDevelopers LIVE – Frontend Inspirations, Web Standards and more
04:05 MIN
How AI code generators have become more reliable
AI in the Open and in Browsers - Tarek Ziadé
14:06 MIN
Exploring the role and ethics of AI in gaming
Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3
03:31 MIN
Using AI to make work more human, not replace humans
Turning People Strategy into a Transformation Engine
Featured Partners
Related Videos
WeAreDevelopers LIVE – AI vs the Web & AI in Browsers
Chris Heilmann, Daniel Cranney & Raymond Camden
Exploring Google Gemini and Generative AI
Exploring the Future of Web AI with Google
Thomas Steiner
Prompt API & WebNN: The AI Revolution Right in Your Browser
Christian Liebel
Generative AI power on the web: making web apps smarter with WebGPU and WebNN
Christian Liebel
AI is an Electric Bike for the Brain - Stoyan Stefanov
From ML to LLM: On-device AI in the Browser
Nico Martin
Google Gemini: Open Source and Deep Thinking Models - Sam Witteveen
Sam Witteveen
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

Google
Charing Cross, United Kingdom
Senior
Google Cloud Platform


Amazon.com, Inc
Shoreham-by-Sea, United Kingdom
XML
HTML
JSON
Python
Data analysis
+1

Amazon.com Inc.
XML
HTML
JSON
Python
Data analysis
+1

Amazon.com Inc.
XML
HTML
JSON
Python
Data analysis
+1

Advanced Micro Devices
Amsterdam, Netherlands
C++
OpenCL
Docker
PyTorch
Kubernetes
+1

The Rolewe
Charing Cross, United Kingdom
API
Python
Machine Learning

Neo4j, Inc.
Charing Cross, United Kingdom
£47K
Senior
Neo4j
React
Machine Learning

Descripción De La Vacante
€40-70K
Azure
Python
PyTorch
TensorFlow
+1