Chris Heilmann, Daniel Cranney, Marius Obert & Staff Developer Evangelist at Twilio
Minimal infrastructure for Real‑Time Phone Agents: transcripts in, responses out
#1about 4 minutes
Why voice is a powerful and natural AI interface
Voice interaction is significantly faster for input than typing and allows for hands-free operation, making it a natural fit for many AI use cases despite the challenges of audio parsing.
#2about 1 minute
The complexity of building traditional voice agents
Building a voice agent the traditional way requires managing separate services for speech recognition, text-to-speech, and interruption detection, which introduces significant latency and complexity.
#3about 4 minutes
Simplifying voice agent architecture with ConversationRelay
Twilio's ConversationRelay abstracts away the complexities of voice processing, allowing developers to receive text transcripts via a WebSocket and focus solely on their application logic.
#4about 6 minutes
Live coding a Deno server for a phone agent
A basic Deno server is set up to handle initial HTTP requests by returning TwiML instructions and to upgrade the connection to a WebSocket for real-time communication.
#5about 3 minutes
Configuring a Twilio number and testing the connection
A new phone number is purchased and configured in the Twilio console to point to the server's webhook, followed by a live call to test the transcription and hardcoded response.
#6about 5 minutes
Integrating OpenAI for streaming dynamic responses
The OpenAI API is integrated to generate dynamic responses, using streaming to send text chunks back as they are generated to minimize perceived latency for the caller.
#7about 2 minutes
Adding conversational memory for context-aware replies
A simple map is used to store the last message ID for each WebSocket connection, enabling the OpenAI API to maintain conversational history for follow-up questions.
#8about 2 minutes
Final demo with AI, history, and interruption
The final demonstration showcases the fully functional AI phone agent handling a multi-turn conversation, remembering context, and allowing the user to interrupt its response.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
44:54 MIN
Live coding a phone agent with Twilio and Deno
WeAreDevelopers LIVE – Real-Time Phone Agents, Unsafe VPNs & More
00:17 MIN
Building a custom voice AI with WebRTC and Google APIs
Raise your voice!
38:46 MIN
Live demo of building a chat with your data app
Inside the AI Revolution: How Microsoft is Empowering the World to Achieve More
20:54 MIN
Live code demo of various AI application patterns
Building AI Applications with LangChain and Node.js
06:14 MIN
Demonstrating basic API capabilities with live examples
Building a Browser-Based Karaoke Game with Web Speech API
02:19 MIN
Prototyping a basic AI code review agent
The Limits of Prompting: ArchitectingTrustworthy Coding Agents
03:46 MIN
How the quiz app was built with Vonage APIs
The Vonage Trivia Voyage: Quiz Your Way to the Top!
26:11 MIN
Abstracting voice infrastructure with Twilio Conversation Relay
Hello JARVIS - Building Voice Interfaces for Your LLMS
Featured Partners
Related Videos
WeAreDevelopers LIVE – Real-Time Phone Agents, Unsafe VPNs & More
Chris Heilmann, Daniel Cranney & Marius Obert
Raise your voice!
Lee Boonstra
Livecoding with AI
Rainer Stropek
Hello JARVIS - Building Voice Interfaces for Your LLMS
Nathaniel Okenwa
From ML to LLM: On-device AI in the Browser
Nico Martin
WeAreDevelopers LIVE - Build a multi AI agents game master with Strands & our weekly web finds
Chris Heilmann, Daniel Cranney & Tiffany Souterre
WeAreDevelopers LIVE – AI vs the Web & AI in Browsers
Chris Heilmann, Daniel Cranney & Raymond Camden
WeAreDevelopers LIVE - Is AI replacing developers?, Stopping bots, AI on device & more
Chris Heilmann & Daniel Cranney & Sebastian Gingter
From learning to earning
Jobs that call for the skills explored in this talk.
NodeJS Software Engineer - Conversational AI
MANGO
Palau-solità i Plegamans, Spain
API
Azure
Redis
Node.js
Salesforce
+6
Software Engineer (AI Platform) - Remote
Replika
Municipality of Madrid, Spain
Remote
API
NoSQL
Python
Docker
+4
Machine Learning Engineer (Conversational AI)
Amber Labs Ltd
Charing Cross, United Kingdom
€61K
API
REST
Azure
Django
+6
AI Infra Engineer - Serverless AI
Project People
Edinburgh, United Kingdom
C++
Python
Docker
Kubernetes
Machine Learning
Senior Software Engineer, AI Model serving (Europe)
Speechify
Municipality of Madrid, Spain
Remote
Azure
Python
Docker
Kubernetes
+1


