Nimrod Kor

Aug 20, 2025 • World Congress 2025

The Limits of Prompting: ArchitectingTrustworthy Coding Agents

Prompt engineering has its limits. Learn how a multi-agent architecture, enriched with deep context, boosted our AI agent's suggestion acceptance rate from 12% to over 60%.

#1about 2 minutes

Prototyping a basic AI code review agent

A simple prototype using a GitHub webhook and a single LLM call reveals the potential for understanding code semantics beyond static analysis.

#2about 2 minutes

Iteratively improving prompts to handle edge cases

Simple prompts fail to consider developer comments or model knowledge cutoffs, requiring more detailed instructions to improve accuracy.

#3about 5 minutes

Establishing a robust benchmarking process for agents

A reliable benchmarking pipeline uses a large dataset, concurrent execution, and an LLM-as-a-judge (LLJ) to measure and track performance improvements.

#4about 2 minutes

Decomposing large tasks into specialized agents

To combat inconsistency and hallucinations, a single large task like code review is broken down into multiple smaller, specialized agents.

#5about 6 minutes

Leveraging codebase context for deeper insights

Moving beyond prompts, providing codebase context via vector similarity (RAG) and module dependency graphs (AST) unlocks high-quality, human-like feedback.

#6about 3 minutes

Introducing Awesome Reviewers for community standards

Awesome Reviewers is a collection of prompts derived from open-source projects that can be used to enforce team-specific coding standards.

#7about 1 minute

Key takeaways for building reliable LLM agents

The path to a reliable agent involves starting with a proof-of-concept, benchmarking rigorously, using prompt engineering for quick fixes, and investing in deep context.

2 days ago

AI Software Engineer (m/f/d)

Sunhat
Köln, Germany

Remote

Senior

14 days ago

Senior Machine Learning Engineer (f/m/d)

MARKT-PILOT GmbH
Stuttgart, Germany

Remote

Senior

7 days ago

Senior AI Software Developer & Mentor

Dynatrace
Linz, Austria

Senior

Featured Partners

How we built an AI-powered code reviewer in 80 hours

How we built an AI-powered code reviewer in 80 hours

Yan Cui

about 2 months ago • World Congress 2025

Three years of putting LLMs into Software - Lessons learned

Three years of putting LLMs into Software - Lessons learned

Simon A.T. Jiménez

about 2 months ago • World Congress 2025

The AI Agent Path to Prod: Building for Reliability

The AI Agent Path to Prod: Building for Reliability

Max Tkacz

about 2 months ago • World Congress 2025

Prompt Engineering - an Art, a Science, or your next Job Title?

Prompt Engineering - an Art, a Science, or your next Job Title?

Maxim Salnikov

about a year ago • World Congress 2024

Bringing the power of AI to your application.

Bringing the power of AI to your application.

Krzysztof Cieślak

about a year ago • World Congress 2024

Beyond Prompting: Building Scalable AI with Multi-Agent Systems and MCP

Beyond Prompting: Building Scalable AI with Multi-Agent Systems and MCP

Viktoria Semaan

about 2 months ago • World Congress 2025

AI: Superhero or Supervillain? How and Why with Scott Hanselman

AI: Superhero or Supervillain? How and Why with Scott Hanselman

Scott Hanselman

about a year ago • World Congress 2024

Using LLMs in your Product

Using LLMs in your Product

Daniel Töws

about a year ago • World Congress 2024

From learning to earning

Jobs that call for the skills explored in this talk.

Senior Backend Engineer – AI Integration (m/w/x)

1 month ago

Senior Backend Engineer – AI Integration (m/w/x)

chatlyn GmbH
Vienna, Austria

Senior

JavaScript

AI-assisted coding tools

5 days ago

Agentic AI Architect - Python, LLMs & NLP

FRG Technology Consulting

Intermediate

Azure

Python

Machine Learning

5 days ago

Security-by-Design for Trustworthy Machine Learning Pipelines

Association Bernard Gregory

Machine Learning

Continuous Delivery

yesterday

AI/ML Team Lead - Generative AI (LLMs, AWS)

Provectus
Canton de Saint-Mihiel, France

Remote

€96K

Senior

Python

PyTorch

TensorFlow

+4

5 days ago

AI/ML Team Lead - Generative AI (LLMs, AWS)

Provectus
Canton de Saint-Mihiel, France

Remote

€96K

Senior

Python

PyTorch

TensorFlow

+4

5 days ago

AI Evaluation Data Scientist - AI/ML/LLM - (Hybrid (Hybrid) - Barcelona

European Tech Recruit
Barcelona, Spain

Intermediate

GIT

Python

Pandas

Docker

PyTorch

+2

5 days ago

Machine Learning Scientist (AI for Code)

SonarSource
Bochum, Germany

Java

Python

PyTorch

TensorFlow

Machine Learning

+1

5 days ago

Full-Stack & LLM Engineer

Neural Concept
Lausanne, Switzerland

Python

Machine Learning