Building a Browser-Based Karaoke Game with Web Speech API
#1about 2 minutes
Creating a gamified karaoke experience in the browser
The motivation for building a custom karaoke game was to expand the song selection for a favorite band and add a scoring mechanic.
#2about 1 minute
An overview of the Web Speech API
The Web Speech API is a native browser feature split into speech recognition (speech-to-text) and speech synthesis (text-to-speech) for voice input and control.
#3about 3 minutes
Understanding the limitations of the Web Speech API
The API's real-world usability is hampered by inconsistent browser support, reliance on vendor-specific cloud services, and privacy concerns.
#4about 5 minutes
Demonstrating basic text-to-speech and voice navigation
A live demonstration shows how the API can be used for simple tasks like reading selected text aloud or enabling basic voice commands on a webpage.
#5about 4 minutes
Implementing the core logic for the karaoke game
The game's implementation involves initializing speech recognition, handling its tendency to stop by restarting it, and comparing the transcribed text against the song's lyrics.
#6about 6 minutes
A live demonstration of the browser karaoke game
The completed karaoke game is tested by speaking, singing, and intentionally using incorrect words to show how it scores the performance.
#7about 3 minutes
The API's current state and related community projects
While the native API is not yet production-ready for commercial use, the community has built useful tools and projects like Web Captioner and Mozilla's Common Voice.
#8about 1 minute
Practical design considerations for voice interfaces
When designing for voice, it's crucial to use concise content, consider conversational flow, and plan for error recovery and cognitive load.
#9about 4 minutes
Finding joy and learning through unproductive side projects
Building "useless" or "silly" side projects is a powerful way to combat burnout, learn new skills, and rediscover the fun in software development.
Related jobs
Jobs that call for the skills explored in this talk.
Featured Partners
Related Videos
Building a Browser-Based Karaoke Game with Web Speech API
Ana Rodrigues
Creating games to make the web fun again
Nolan Royalty
Beats in the Browser: Coding Music with JavaScript
Rowdy Rabouw
Is the web ready for voice user interfaces?
Tobias Münch
Catching up on the basics you don't really need that much code
Chris Heilmann
New Browser APIs, End of React Create App, Smuggling Data in Emoji and more with Rowdy Rabouw
Chris Heilmann & Daniel Cranney & Rowdy Rabouw
From ML to LLM: On-device AI in the Browser
Nico Martin
Beats in the Browser: Coding Music with JavaScript
From learning to earning
Jobs that call for the skills explored in this talk.
Software Engineer, iOS Core Product - San Sebastián, Spain
Speechify
Morcín, Spain
GIT
Continuous Integration
Software Engineer, Platform - Edinburgh, United Kingdom
Speechify
Edinburgh, United Kingdom
€50K
Go
Java
React
Python
+2
Software Engineer, Platform - Bristol, United Kingdom
Speechify
Bristol, United Kingdom
€50K
Go
Java
React
Python
+2
Software Engineer, Platform - Liverpool, United Kingdom
Speechify
Liverpool, United Kingdom
€49K
Go
Java
React
Python
+2


