Stanislas Girard
Chatbots are going to destroy infrastructures and your cloud bills
#1about 3 minutes
Comparing web developers and data scientists before GenAI
Before generative AI, web developers focused on CPU-bound tasks and horizontal scaling while data scientists worked with GPU-bound tasks and vast resources.
#2about 3 minutes
The new AI engineer role and the RAG pipeline
The emergence of the AI engineer role combines web development and data science skills, often applied to building RAG pipelines for data ingestion and querying.
#3about 2 minutes
Key architectural challenges in building GenAI apps
Generative AI applications face unique architectural problems, including long response times, sequential bottlenecks, and the difficulty of mixing CPU and GPU-bound processes.
#4about 3 minutes
How a simple chatbot evolves into a large monolith
Adding features like document ingestion and web scraping to a simple chatbot can rapidly increase its resource consumption and Docker image size, creating a complex monolith.
#5about 4 minutes
Refactoring a monolithic AI app into a service architecture
To manage complexity and cost, a monolithic AI application should be refactored by separating user-facing logic from heavy background tasks into distinct, independently scalable services.
#6about 3 minutes
Choosing the right architecture for your application's workload
A monolithic architecture is suitable for low or continuous workloads, while a service-based approach is necessary for applications with high or spiky traffic to manage costs and scale effectively.
#7about 2 minutes
Overlooked challenges of running AI applications in production
Beyond core architecture, running AI in production involves complex challenges like managing GPUs on Kubernetes, model versioning, data compliance, and testing non-deterministic outputs.
#8about 2 minutes
Using creative evaluations and starting with small models
A creative evaluation using a game like Street Fighter reveals that smaller, faster LLMs can outperform larger ones for many use cases, making them a better starting point.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
26:27 MIN
Managing the rapid pace of AI development and its impact
From Monolith Tinkering to Modern Software Development
00:05 MIN
Introduction to large-scale AI infrastructure challenges
Your Next AI Needs 10,000 GPUs. Now What?
26:01 MIN
Using generative AI to enhance developer productivity
Throwing off the burdens of scale in engineering
07:56 MIN
How generative AI is changing software development
The transformative impact of GenAI for software development and its implications for cybersecurity
14:40 MIN
The impact of ChatGPT and the rise of chat interfaces
Innovating Developer Tools with AI: Insights from GitHub Next
02:55 MIN
Positioning generative AI as the next major technology shift
The Data Phoenix: The future of the Internet and the Open Web
10:03 MIN
Overcoming the key challenges of building with GenAI
The State of GenAI & Machine Learning in 2025
00:57 MIN
Navigating the overwhelming wave of generative AI adoption
Developer Experience, Platform Engineering and AI powered Apps
Featured Partners
Related Videos
Should we build Generative AI into our existing software?
Simon Müller
AI: Superhero or Supervillain? How and Why with Scott Hanselman
Scott Hanselman
How AI Models Get Smarter
Ankit Patel
Make it simple, using generative AI to accelerate learning
Duan Lightfoot
Livecoding with AI
Rainer Stropek
Using LLMs in your Product
Daniel Töws
Bringing the power of AI to your application.
Krzysztof Cieślak
Supercharge your cloud-native applications with Generative AI
Cedric Clyburn
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.


Generative AI Engineer | Cloud & Leadership
Descripción De La Vacante
€40-70K
Azure
Python
PyTorch
TensorFlow
+1


AI/ML Data Scientist: Build & Deploy Generative Models
UL Solutions
Barcelona, Spain
Python
Machine Learning

FULLREMOTE- Chatbot Developer / AI Engineer
Capitole
Barcelona, Spain
Remote
C++
Python
PyTorch
TensorFlow
+3


Agentic AI Architect - Python, LLMs & NLP
FRG Technology Consulting
Intermediate
Azure
Python
Machine Learning

FULLREMOTE- Chatbot Developer / AI Engineer
Capitole
Municipality of Vitoria-Gasteiz, Spain
Remote
C++
Python
PyTorch
TensorFlow
+3

FULLREMOTE- Chatbot Developer / AI Engineer
Capitole
Municipality of Girona, Spain
Remote
C++
Python
PyTorch
TensorFlow
+3