Build Voice-Enabled Apps with Speech Recognition

Implement speech-to-text and voice control. These sessions cover popular APIs, processing audio streams, and training custom models for assistants, transcription, and accessibility features.

Matching Videos

Building a Browser-Based Karaoke Game with Web Speech API
28:09

Building a Browser-Based Karaoke Game with Web Speech API

Ana Rodrigues

Is the web ready for voice user interfaces?
23:09

Is the web ready for voice user interfaces?

Tobias Münch

Speak, Code, Deploy: Transforming Developer Experience with Voice Commands
28:37

Speak, Code, Deploy: Transforming Developer Experience with Voice Commands

Sami Ekblad

From ML to LLM: On-device AI in the Browser
27:23

From ML to LLM: On-device AI in the Browser

Nico Martin

Summarising Videos Privately Without Cloud APIs - Harald Nezbeda
41:16

Summarising Videos Privately Without Cloud APIs - Harald Nezbeda

Harald Nezbeda

Unleash your web skills on native!
52:01

Unleash your web skills on native!

Rowdy Rabouw

WeAreDevelopers LIVE – Real-Time Phone Agents, Unsafe VPNs & More
1:16:15

WeAreDevelopers LIVE – Real-Time Phone Agents, Unsafe VPNs & More

Chris Heilmann, Daniel Cranney & Marius Obert

Minimal infrastructure for Real‑Time Phone Agents: transcripts in, responses out
29:35

Minimal infrastructure for Real‑Time Phone Agents: transcripts in, responses out

Chris Heilmann, Daniel Cranney, Marius Obert & Staff Developer Evangelist at Twilio

.NET Apps Everywhere!
26:33

.NET Apps Everywhere!

Steve Bilogan

Building a Browser-Based Karaoke Game with Web Speech API
28:09

Building a Browser-Based Karaoke Game with Web Speech API

Rust and Docker: Let's build an AI-powered app!
25:56

Rust and Docker: Let's build an AI-powered app!

Francesco Ciulla

Raise your voice!
30:59

Raise your voice!

Lee Boonstra

Let your iOS app read texts
36:34

Let your iOS app read texts

Milan Todorovic

Hello JARVIS - Building Voice Interfaces for Your LLMS
28:13

Hello JARVIS - Building Voice Interfaces for Your LLMS

Nathaniel Okenwa

Building Blocks of RAG: From Understanding to Implementation
26:25

Building Blocks of RAG: From Understanding to Implementation

Ashish Sharma

Three years of putting LLMs into Software - Lessons learned
26:30

Three years of putting LLMs into Software - Lessons learned

Simon A.T. Jiménez

RPA in the Public Sector
28:36

RPA in the Public Sector

Clemens Schwaiger

Vikings language, the speech of the king Vasa or today's Swedish? Text classification with ML.NET.
27:51

Vikings language, the speech of the king Vasa or today's Swedish? Text classification with ML.NET.

Daniel Gaszewski

Build Your First AI Assistant in 30 Minutes: No Code Workshop
18:55

Build Your First AI Assistant in 30 Minutes: No Code Workshop

Leandro Gomes da Silva

Build RAG from Scratch
28:04

Build RAG from Scratch

Phil Nash

Robots 2.0: When artificial intelligence meets steel
28:19

Robots 2.0: When artificial intelligence meets steel

Thomas Tomow

Semantic AI: Why Embeddings Might Matter More Than LLMs
26:05

Semantic AI: Why Embeddings Might Matter More Than LLMs

Christian Weyer

Hybrid AI: Next Generation Natural Language Processing
21:01

Hybrid AI: Next Generation Natural Language Processing

Jan Schweiger

Inside the Mind of an LLM
27:11

Inside the Mind of an LLM

Emanuele Fabbiani

AI: Superhero or Supervillain? How and Why with Scott Hanselman
25:17

AI: Superhero or Supervillain? How and Why with Scott Hanselman

Scott Hanselman

Fighting Fraud with an AI Grandma - Ben Hopkins and Morten Legarth from faith @ VCCP
38:09

Fighting Fraud with an AI Grandma - Ben Hopkins and Morten Legarth from faith @ VCCP

Ben Hopkins & Morten Legarth

Tolgee: Open-Source, In-Context AI Localization That Cuts Dev Effort
03:36

Tolgee: Open-Source, In-Context AI Localization That Cuts Dev Effort

Jan Cizmar

Adding knowledge to open-source LLMs
25:35

Adding knowledge to open-source LLMs

Sergio Perez & Harshita Seth

Unboxing the DeepFace
45:19

Unboxing the DeepFace

Sefik Serengil

Introducing Digital Samba Embedded Video Conferencing API MCP Server
04:30

Introducing Digital Samba Embedded Video Conferencing API MCP Server

Robert Strobl

Fake or News: Translating Dog Barks, Notepad Gets an Upgrade and Michelin-Star Robots - Paul Tregoing
03:14

Fake or News: Translating Dog Barks, Notepad Gets an Upgrade and Michelin-Star Robots - Paul Tregoing

Chris Heilmann, Daniel Cranney & Paul Tregoing

Using LLMs in your Product
31:12

Using LLMs in your Product

Daniel Töws

Creating Industry ready solutions with LLM Models
58:00

Creating Industry ready solutions with LLM Models

Vijay Krishan Gupta & Gauravdeep Singh Lotey

How to Survive with Dyslexia as a Developer
08:30

How to Survive with Dyslexia as a Developer

Niklas Wünnemann

Detect Hand Pose with Vision
28:51

Detect Hand Pose with Vision

Milan Todorovic

Serverless deployment of (large) NLP models
39:02

Serverless deployment of (large) NLP models

Marek Suppa