Tobias Münch
Is the web ready for voice user interfaces?
#1about 3 minutes
Why voice user interfaces are important for accessibility
Voice interfaces can significantly improve web accessibility for users with disabilities and provide hands-free convenience for mobile professionals.
#2about 1 minute
Understanding the Web Speech API's core functions
The Web Speech API is a W3C standard divided into speech recognition for converting voice to text and speech synthesis for converting text to voice.
#3about 2 minutes
Reviewing VUI research and its current limitations
Research projects like the Conversational Web and a wheelchair VUI demonstrate potential but suffer from inconsistent accuracy, online-only functionality, and lack of wake words.
#4about 3 minutes
How to implement the Web Speech API in JavaScript
Learn the step-by-step process of implementing speech recognition, including loading the class, configuring grammar with JSGF, starting the listener, and processing the results.
#5about 2 minutes
Navigating the Web Speech API's result data structure
The API returns a nested data structure containing a list of results, each with alternatives that include the text transcript and a confidence score.
#6about 3 minutes
Key challenges limiting Web Speech API adoption
The API's adoption is hindered by significant issues including poor developer experience, privacy risks from cloud processing, no offline support, and inconsistent browser implementations.
#7about 3 minutes
A look inside the browser's implementation of speech recognition
An analysis of the Chromium source code reveals how the Web Speech API is implemented through layers that manage and dispatch recognition tasks to either remote cloud services or local OS-dependent engines.
#8about 5 minutes
The future of VUIs with Stanford's React Genie
Stanford's React Genie project offers a new paradigm by loosely coupling a voice agent with React state, allowing for complex voice commands that can manipulate off-screen content and application logic.
#9about 1 minute
Final verdict on the web's readiness for voice UIs
While the current Web Speech API is suitable for experimentation, it is not reliable enough for production use, but promising research indicates a more capable future for web-based voice interfaces.
Related jobs
Jobs that call for the skills explored in this talk.
Wilken GmbH
Ulm, Germany
Senior
Kubernetes
AI Frameworks
+3
Hubert Burda Media
München, Germany
€80-95K
Intermediate
Senior
JavaScript
Node.js
+1
Matching moments
04:57 MIN
Increasing the value of talk recordings post-event
Cat Herding with Lions and Tigers - Christian Heilmann
03:28 MIN
Why corporate AI adoption lags behind the hype
What 2025 Taught Us: A Year-End Special with Hung Lee
03:15 MIN
The future of recruiting beyond talent acquisition
What 2025 Taught Us: A Year-End Special with Hung Lee
02:44 MIN
Rapid-fire thoughts on the future of work
What 2025 Taught Us: A Year-End Special with Hung Lee
03:48 MIN
Automating formal processes risks losing informal human value
What 2025 Taught Us: A Year-End Special with Hung Lee
04:22 MIN
Why HR struggles with technology implementation and adoption
What 2025 Taught Us: A Year-End Special with Hung Lee
04:57 MIN
Shifting from formal corporate speak to an authentic voice
Leveraging Leaders’ Voices: The Business Power of Personal Branding
06:44 MIN
Using Chrome's built-in AI for on-device features
Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3
Featured Partners
Related Videos
Speak, Code, Deploy: Transforming Developer Experience with Voice Commands
Sami Ekblad
What’s New and What’s Next in Web UI
Cleyra Uzcategui
Hello JARVIS - Building Voice Interfaces for Your LLMS
Nathaniel Okenwa
Building a Browser-Based Karaoke Game with Web Speech API
Ana Rodrigues
From ML to LLM: On-device AI in the Browser
Nico Martin
Exploring the Future of Web AI with Google
Thomas Steiner
Prompt API & WebNN: The AI Revolution Right in Your Browser
Christian Liebel
The State Of The Web
Jeremy Keith
Related Articles
View all articles


.webp?w=240&auto=compress,format)
From learning to earning
Jobs that call for the skills explored in this talk.




Cerence
Ulm, Germany
Figma
Adobe After Effects


Speechify
Municipality of Madrid, Spain
Python
Kubernetes

Comunidad de Madrid
Municipality of Madrid, Spain
€40-60K
Figma
Python
Agile Methodologies

MUUUH! GmbH
Osnabrück, Germany
Senior
REST
Data analysis
Microsoft Office
Amazon Web Services (AWS)

VisualMakers GmbH
Köln, Germany
€56-80K
GIT
React
Flask
Python
+7