Generate AI in the Browser with Chrome AI - Raymond Camden

Add powerful, on-device generative AI to your web app—no server required. Learn how with the new Chrome AI APIs.

Generate AI in the Browser with Chrome AI - Raymond Camden
#1about 1 minute

Introduction to generative AI in the browser

The speaker sets expectations for the talk, assuming basic familiarity with generative AI and JavaScript, and provides links to the presentation materials.

#2about 5 minutes

Understanding the fundamentals of Chrome AI

Chrome AI integrates the Gemini Nano model directly into the browser for task-focused operations, requiring progressive enhancement and a one-time model download.

#3about 3 minutes

Exploring the features of the Chrome AI APIs

The APIs are available for both browsers and extensions, supporting features like streaming, sessions for conversational context, and multimodal input with images and audio.

#4about 3 minutes

Using the translator and language detector APIs

The translator API converts text between languages, while the language detector API identifies the language of a given text and provides a confidence score.

#5about 2 minutes

How to use the summarizer API for text

The summarizer API can generate different styles of summaries, such as key points or headlines, but may sometimes include external context not present in the original text.

#6about 3 minutes

Generating and correcting text with built-in APIs

The writer and rewriter APIs generate or transform text based on tone and length, while the new proofreader API identifies spelling and grammar errors.

#7about 1 minute

Leveraging the flexible general purpose prompt API

The prompt API offers a flexible, session-based interface for general-purpose tasks, supporting system instructions, structured output, and multimodal inputs like images and audio.

#8about 7 minutes

A three-step guide to implementing Chrome AI

Implementing any Chrome AI feature involves checking if the API exists, verifying its availability, and then creating an instance while handling the model download progress.

#9about 4 minutes

Live demos of the translator and summarizer APIs

A demonstration shows the translator API converting English to Mandarin and the summarizer API condensing the Gettysburg Address, highlighting its speed and options.

#10about 4 minutes

Demonstrating the rewriter and prompt APIs

The rewriter API is used to make text more casual and shorter, while the prompt API analyzes the content of images to generate detailed descriptions.

#11about 2 minutes

Enhancing image analysis with geolocation data

Combining the prompt API with EXIF geolocation data from an image allows the model to generate significantly more context-aware and accurate descriptions.

#12about 2 minutes

Final resources and where to learn more

The presentation concludes with links to official documentation, online playgrounds for testing, and information on joining the early preview program for updates.

Related jobs
Jobs that call for the skills explored in this talk.

Featured Partners

Related Articles

View all articles
EM
Eli McGarvie
11 Best AI Chrome Extensions for Developers
In this brave new world of software development, time is a precious commodity, and developers are always on the lookout for clever shortcuts. With AI Chrome Extensions at your disposal, you’ll be able to tap into the extraordinary powers of AI to str...
11 Best AI Chrome Extensions for Developers
CH
Chris Heilmann
With AIs wide open - WeAreDevelopers at All Things Open 2025
Last week our VP of Developer Relations, Chris Heilmann, flew to Raleigh, North Carolina to present at All Things Open . An excellent event he had spoken at a few times in the past and this being the “Lucky 13” edition, he didn’t hesitate to come and...
With AIs wide open - WeAreDevelopers at All Things Open 2025
DC
Daniel Cranney
Dev Digest 161: Gemini 2.5, AI killing search, EU A11Y Act
Inside last week’s Dev Digest 161 . 🤖 Most traffic to web sites comes from AI chatbots 🖼️ Google releases Gemini 2.5 and OpenAI adds native image generation ⬛︎ Next.js has a big security issue 👨‍💻 How hackers weaponise code agents 📜 WikiTok analysed...
Dev Digest 161: Gemini 2.5, AI killing search, EU A11Y Act

From learning to earning

Jobs that call for the skills explored in this talk.

Generative AI Developer

Generative AI Developer

University of the Arts, London
Sleaford, United Kingdom

£34-41K
Python
PyTorch
TensorFlow
Generative AI Engineer

Generative AI Engineer

Generative Ai Engineer83zero Limited
Glasgow, United Kingdom

£80-88K
GIT
Azure
NoSQL
React
+16