Nimrod Kor
The Limits of Prompting: ArchitectingTrustworthy Coding Agents
#1about 2 minutes
Prototyping a basic AI code review agent
A simple prototype using a GitHub webhook and a single LLM call reveals the potential for understanding code semantics beyond static analysis.
#2about 2 minutes
Iteratively improving prompts to handle edge cases
Simple prompts fail to consider developer comments or model knowledge cutoffs, requiring more detailed instructions to improve accuracy.
#3about 5 minutes
Establishing a robust benchmarking process for agents
A reliable benchmarking pipeline uses a large dataset, concurrent execution, and an LLM-as-a-judge (LLJ) to measure and track performance improvements.
#4about 2 minutes
Decomposing large tasks into specialized agents
To combat inconsistency and hallucinations, a single large task like code review is broken down into multiple smaller, specialized agents.
#5about 6 minutes
Leveraging codebase context for deeper insights
Moving beyond prompts, providing codebase context via vector similarity (RAG) and module dependency graphs (AST) unlocks high-quality, human-like feedback.
#6about 3 minutes
Introducing Awesome Reviewers for community standards
Awesome Reviewers is a collection of prompts derived from open-source projects that can be used to enforce team-specific coding standards.
#7about 1 minute
Key takeaways for building reliable LLM agents
The path to a reliable agent involves starting with a proof-of-concept, benchmarking rigorously, using prompt engineering for quick fixes, and investing in deep context.
Related jobs
Jobs that call for the skills explored in this talk.
Wilken GmbH
Ulm, Germany
Senior
Amazon Web Services (AWS)
Kubernetes
+1
Matching moments
03:29 MIN
The evolution from prompt engineering to context engineering
Engineering Productivity: Cutting Through the AI Noise
05:55 MIN
How to structure prompts and specifications for LLMs
Building and Modernising Apps with Agentic AI - Julia Kordick
04:43 MIN
The limitations and frustrations of coding with LLMs
WAD Live 22/01/2025: Exploring AI, Web Development, and Accessibility in Tech with Stefan Judis
03:31 MIN
Effective prompting and defensive coding for LLMs
Lessons Learned Building a GenAI Powered App
02:27 MIN
An overview of an AI-powered code reviewer
How we built an AI-powered code reviewer in 80 hours
01:55 MIN
Leveraging AI tools to correct code and text
How to Survive with Dyslexia as a Developer
02:33 MIN
Why you need to prompt large language models like a child
Developers vs Scammers, Bad Design, AI is Pointless, AJAX is 20 and more - The Best of LIVE 2025 - Part 1
02:58 MIN
Shifting from traditional code to AI-powered logic
WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA
Featured Partners
Related Videos
How we built an AI-powered code reviewer in 80 hours
Yan Cui
Three years of putting LLMs into Software - Lessons learned
Simon A.T. Jiménez
Using LLMs in your Product
Daniel Töws
Bringing the power of AI to your application.
Krzysztof Cieślak
The AI Agent Path to Prod: Building for Reliability
Max Tkacz
Prompt Engineering - an Art, a Science, or your next Job Title?
Maxim Salnikov
New AI-Centric SDLC: Rethinking Software Development with Knowledge Graphs
Gregor Schumacher, Sujay Joshy & Marcel Gocke
Beyond Prompting: Building Scalable AI with Multi-Agent Systems and MCP
Viktoria Semaan
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

Wolters Kluwer
Alphen aan den Rijn, Netherlands
Intermediate
Node.js
TypeScript
Cloud (AWS/Google/Azure)

AVEX Automotive GmbH & Co. KG
Hamburg, Germany
Intermediate
React
OpenAI API


Collaboration Betters The World GmbH
API
Azure
Flask
Python
FastAPI
+2

Looma Gmbh
Magdeburg, Germany
API
DevOps
Docker
Kubernetes
Continuous Integration


Mindrift
Remote
£41K
Junior
JSON
Python
Data analysis
+1

Dr. Dienst & Partner GmbH & Co. KG
Frankfurt am Main, Germany
API
Azure
Python

ami Consulting
Canton of Neuilly-sur-Seine, France
Remote
Senior
API
GIT
Java
REST
+19