Building Agents Securely at Scale - Alfonso Graziano
What's the difference between a toy AI agent and a production-ready one? Learn to build secure, scalable systems that can handle the chaos of real-world users.
#1about 2 minutes
The gap between simple tutorials and production AI agents
Tutorials often show a simplistic happy path, but real-world agents require robust systems to handle diverse and unexpected user queries.
#2about 3 minutes
Essential components for building production-ready agents
Productionizing an agent requires moving beyond basic prompts to include evaluations, golden datasets, tracing, user feedback loops, and security guardrails.
#3about 1 minute
Unique security challenges of non-deterministic LLMs
Unlike deterministic software, LLM-based agents can be tricked through new attack vectors like prompt injection, requiring specialized security considerations.
#4about 2 minutes
Key resources for learning agent development
Recommended learning materials include understanding LLM fundamentals, reading "Agentic Design Patterns," and focusing on building proper evaluations and golden datasets.
#5about 4 minutes
The evolving role of developers with AI agents
Developers are shifting from writing all code to acting as tech leads for their own teams of agents, requiring skills in review, direction, and oversight.
#6about 3 minutes
Security risks of running AI agents locally
Giving agents like OpenClau unrestricted access to a local machine is risky; it's better to use sandboxed or containerized environments for experimentation.
#7about 2 minutes
Mitigating hallucinations and sycophancy in agents
Hallucinations and sycophancy persist in agents, but they can be mitigated through a combination of model-level improvements and system-level guardrails.
#8about 4 minutes
Learnings from a real-world agent implementation
A case study highlights the importance of building a comprehensive golden dataset, fostering deep collaboration with SMEs, and using a tight user feedback loop for continuous improvement.
#9about 2 minutes
The surprising power of simple system prompts
While models now have built-in capabilities like Chain of Thought, adding simple, domain-specific sentences to a system prompt can still dramatically improve performance.
#10about 7 minutes
Common enterprise use cases for AI agents
Agents are primarily used for intelligent search across large datasets and for automating complex, reproducible human workflows with human-in-the-loop oversight.
Related jobs
Jobs that call for the skills explored in this talk.
Dev Digest 217: All About Cookies, Better Agents & How OpenAI Built CodexInside last week’s Dev Digest 217 .
🍪 All you ever wanted to know about cookies
🤖 Why ChatGPT cites one page over another
🛠️ How to build better agents
🌐 Is your site agent ready?
🔐 Get started on GitHub Actions security
🧠 How OpenAI built Codex
🧹 G...
Daniel Cranney
Dev Digest 211: Securing Agents, Top AI Apps and Lost Readers…Inside last week’s Dev Digest 211 .
🏗️ Can the infrastructure keep up with AI growth?
📱 Top 100 GenAI consumer apps
🪱 Wikipedia hit by worm and AI slop
🔍 The results of Codex Security scanning 1.2M commits
🧹 Bye bye innerHTML, welcome setHTML()
🔄 Cl...
Dev Digest 210: AI Agents Are Go! Is MCP Dead? LLMs Crack AnonymityInside last week’s Dev Digest 210 .
🪦 Is MCP already dead?
🐍 Secure snake on the CLI
🏗️ The architecture behind open source LLMs
⚖️ AI companies and governments at odds
🦫 Is Go the best language for AI agents?
🕵️ “Security research” bot hacks Micros...
From learning to earning
Jobs that call for the skills explored in this talk.