Building Agents Securely at Scale - Alfonso Graziano

What's the difference between a toy AI agent and a production-ready one? Learn to build secure, scalable systems that can handle the chaos of real-world users.

#1about 2 minutes

The gap between simple tutorials and production AI agents

Tutorials often show a simplistic happy path, but real-world agents require robust systems to handle diverse and unexpected user queries.

#2about 3 minutes

Essential components for building production-ready agents

Productionizing an agent requires moving beyond basic prompts to include evaluations, golden datasets, tracing, user feedback loops, and security guardrails.

#3about 1 minute

Unique security challenges of non-deterministic LLMs

Unlike deterministic software, LLM-based agents can be tricked through new attack vectors like prompt injection, requiring specialized security considerations.

#4about 2 minutes

Key resources for learning agent development

Recommended learning materials include understanding LLM fundamentals, reading "Agentic Design Patterns," and focusing on building proper evaluations and golden datasets.

#5about 4 minutes

The evolving role of developers with AI agents

Developers are shifting from writing all code to acting as tech leads for their own teams of agents, requiring skills in review, direction, and oversight.

#6about 3 minutes

Security risks of running AI agents locally

Giving agents like OpenClau unrestricted access to a local machine is risky; it's better to use sandboxed or containerized environments for experimentation.

#7about 2 minutes

Mitigating hallucinations and sycophancy in agents

Hallucinations and sycophancy persist in agents, but they can be mitigated through a combination of model-level improvements and system-level guardrails.

#8about 4 minutes

Learnings from a real-world agent implementation

A case study highlights the importance of building a comprehensive golden dataset, fostering deep collaboration with SMEs, and using a tight user feedback loop for continuous improvement.

#9about 2 minutes

The surprising power of simple system prompts

While models now have built-in capabilities like Chain of Thought, adding simple, domain-specific sentences to a system prompt can still dramatically improve performance.

#10about 7 minutes

Common enterprise use cases for AI agents

Agents are primarily used for intelligent search across large datasets and for automating complex, reproducible human workflows with human-in-the-loop oversight.