Stanislas Girard
Chatbots are going to destroy infrastructures and your cloud bills
#1about 3 minutes
Comparing web developers and data scientists before GenAI
Before generative AI, web developers focused on CPU-bound tasks and horizontal scaling while data scientists worked with GPU-bound tasks and vast resources.
#2about 3 minutes
The new AI engineer role and the RAG pipeline
The emergence of the AI engineer role combines web development and data science skills, often applied to building RAG pipelines for data ingestion and querying.
#3about 2 minutes
Key architectural challenges in building GenAI apps
Generative AI applications face unique architectural problems, including long response times, sequential bottlenecks, and the difficulty of mixing CPU and GPU-bound processes.
#4about 3 minutes
How a simple chatbot evolves into a large monolith
Adding features like document ingestion and web scraping to a simple chatbot can rapidly increase its resource consumption and Docker image size, creating a complex monolith.
#5about 4 minutes
Refactoring a monolithic AI app into a service architecture
To manage complexity and cost, a monolithic AI application should be refactored by separating user-facing logic from heavy background tasks into distinct, independently scalable services.
#6about 3 minutes
Choosing the right architecture for your application's workload
A monolithic architecture is suitable for low or continuous workloads, while a service-based approach is necessary for applications with high or spiky traffic to manage costs and scale effectively.
#7about 2 minutes
Overlooked challenges of running AI applications in production
Beyond core architecture, running AI in production involves complex challenges like managing GPUs on Kubernetes, model versioning, data compliance, and testing non-deterministic outputs.
#8about 2 minutes
Using creative evaluations and starting with small models
A creative evaluation using a game like Street Fighter reveals that smaller, faster LLMs can outperform larger ones for many use cases, making them a better starting point.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
14:40 MIN
The impact of ChatGPT and the rise of chat interfaces
Innovating Developer Tools with AI: Insights from GitHub Next
02:55 MIN
Positioning generative AI as the next major technology shift
The Data Phoenix: The future of the Internet and the Open Web
18:03 MIN
GenAI applications and emerging professional roles
Enter the Brave New World of GenAI with Vector Search
00:17 MIN
Building a custom voice AI with WebRTC and Google APIs
Raise your voice!
01:32 MIN
Practical examples of using AI in daily life
Collaborative Intelligence: The Human & AI Partnership
00:05 MIN
Moving beyond hype with real-world generative AI
Semantic AI: Why Embeddings Might Matter More Than LLMs
00:05 MIN
The AI revolution and its impact on the job market
Recruiting with Soul & Smarts
00:05 MIN
The core challenge of scaling AI agent communication
Event-Driven Architecture: Breaking Conversational Barriers with Distributed AI Agents
Featured Partners
Related Videos
Should we build Generative AI into our existing software?
Simon Müller
AI: Superhero or Supervillain? How and Why with Scott Hanselman
Scott Hanselman
How AI Models Get Smarter
Ankit Patel
Make it simple, using generative AI to accelerate learning
Duan Lightfoot
Livecoding with AI
Rainer Stropek
Using LLMs in your Product
Daniel Töws
Bringing the power of AI to your application.
Krzysztof Cieślak
Supercharge your cloud-native applications with Generative AI
Cedric Clyburn
From learning to earning
Jobs that call for the skills explored in this talk.








