Andreas Erben

Aug 20, 2025 • World Congress 2025

You are not my model anymore - understanding LLM model behavior

Your LLM is a shoggoth with a smiley face mask. Learn what happens when the mask slips and your application breaks.

#1about 2 minutes

Unexpected LLM behavior from hidden platform updates

A practical demonstration shows how a cloud provider's content filter update can unexpectedly block access to documents, causing application failures.

#2about 3 minutes

How LLMs generate text and learn behavior

Large language models use a transformer architecture to predict the next token based on probability, with instruction tuning and alignment shaping their final behavior.

#3about 2 minutes

The opaque and complex stack of modern LLM services

Major LLM providers operate in secrecy, and the full technology stack from model weights to the API is complex, leaving developers with limited visibility and control.

#4about 3 minutes

Managing risks from provider filters and short API lifecycles

Cloud provider content filters can change without notice, creating vulnerabilities, while the short lifecycle of model APIs requires constant adaptation.

#5about 4 minutes

Understanding LLMs as alien minds with fragile alignment

LLMs are conceptually like alien intelligences with a fragile, human-like alignment layer that can be bypassed by jailbreaks exploiting internal model circuits.

#6about 2 minutes

How model personalities and behaviors shift between versions

Different LLM versions exhibit distinct behaviors and may ignore system prompts, as shown by a comparison between GPT-4 and a newer reasoning model.

#7about 3 minutes

Using evaluations to systematically test model behavior

Systematically test model behavior using evaluations, which can be automated by generating prompt variations or using pre-built cloud and open-source frameworks.

#8about 4 minutes

Using prompt engineering to mitigate model drift

Mitigate model behavior drift by using advanced prompt engineering techniques like forcing reasoning, providing few-shot examples, and being highly explicit in instructions.

24 days ago

AI Software Engineer (m/f/d)

Sunhat
Köln, Germany

Remote

Senior

1 month ago

Senior Machine Learning Engineer (f/m/d)

MARKT-PILOT GmbH
Stuttgart, Germany

Remote

Senior

10 days ago

Lead Fullstack Engineer AI

Hubert Burda Media
München, Germany

Intermediate

Shifting from traditional code to AI-powered logic

09:55 MIN

Shifting from traditional code to AI-powered logic

WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA

The ethical risks of outdated and insecure AI models

13:54 MIN

The ethical risks of outdated and insecure AI models

AI & Ethics

AI privacy concerns and prompt engineering

25:33 MIN

AI privacy concerns and prompt engineering

Coffee with Developers - Cassidy Williams -

The technical challenges of running LLMs in browsers

09:43 MIN

The technical challenges of running LLMs in browsers

From ML to LLM: On-device AI in the Browser

The limitations and potential of AI models

20:05 MIN

The limitations and potential of AI models

Coffee with Developers - Cassidy Williams -

The danger of over-engineering with LLMs

16:53 MIN

The danger of over-engineering with LLMs

Event-Driven Architecture: Breaking Conversational Barriers with Distributed AI Agents

The rapid adoption of LLMs outpaces security practices

00:03 MIN

The rapid adoption of LLMs outpaces security practices

ChatGPT, ignore the above instructions! Prompt injection attacks and how to avoid them.

Final thoughts on developer accountability and AI tooling

27:27 MIN

Final thoughts on developer accountability and AI tooling

Vibe coding sucks! Long life to vibe coding: Hardening Applications for Production with GenAI

Featured Partners

Three years of putting LLMs into Software - Lessons learned

Three years of putting LLMs into Software - Lessons learned

Simon A.T. Jiménez

about 2 months ago • World Congress 2025

Beyond the Hype: Building Trustworthy and Reliable LLM Applications with Guardrails

Beyond the Hype: Building Trustworthy and Reliable LLM Applications with Guardrails

Alex Soto

about 2 months ago • World Congress 2025

AI: Superhero or Supervillain? How and Why with Scott Hanselman

AI: Superhero or Supervillain? How and Why with Scott Hanselman

Scott Hanselman

about a year ago • World Congress 2024

How AI Models Get Smarter

How AI Models Get Smarter

Ankit Patel

about 3 months ago • World Congress 2025

Prompt Injection, Poisoning & More: The Dark Side of LLMs

Prompt Injection, Poisoning & More: The Dark Side of LLMs

Keno Dreßel

about 2 months ago • World Congress 2025

Inside the Mind of an LLM

Inside the Mind of an LLM

Emanuele Fabbiani

about 2 months ago • World Congress 2025

From Traction to Production: Maturing your GenAIOps step by step

From Traction to Production: Maturing your GenAIOps step by step

Maxim Salnikov

about 2 months ago • World Congress 2025

Bringing the power of AI to your application.

Bringing the power of AI to your application.

Krzysztof Cieślak

about a year ago • World Congress 2024

From learning to earning

Jobs that call for the skills explored in this talk.

Senior Researcher for Generative AI

30 days ago

Senior Researcher for Generative AI

Dynatrace
Linz, Austria

Senior

AI Frameworks

AI Software Engineer - Model Evaluation

today

AI Software Engineer - Model Evaluation

Aleph Alpha

PyTorch

AI Engineer / Machine Learning Engineer / KI-Entwickler - Schwerpunkt Cloud & MLOps

today

AI Engineer / Machine Learning Engineer / KI-Entwickler - Schwerpunkt Cloud & MLOps

Agenda GmbH

Intermediate

API

Azure

Python

Docker

PyTorch

+9

AI/ Machine Learning Consultant *

today

AI/ Machine Learning Consultant *

XL2 GmbH

Remote

NumPy

Python

Pandas

PyTorch

+5

AIML -Machine Learning Research, DMLI

today

AIML -Machine Learning Research, DMLI

Apple

Python

PyTorch

TensorFlow

Machine Learning

Natural Language Processing

ML Engineer - MLOps/Data Focus

today

ML Engineer - MLOps/Data Focus

Baunex

Remote

ETL

GIT

Java

Kafka

+8

Net Engineer with AI Focus

today

Net Engineer with AI Focus

Speech Processing Solutions

Remote

€65K

Intermediate

GIT

.NET

REST

+10

AI Model Training & Refinement Specialist - AI

today

AI Model Training & Refinement Specialist - AI

FDTech GmbH

R

GIT

Python

A/B testing

Machine Learning

+1

Cloud Engineer - AI Platform (LLM)

today

Cloud Engineer - AI Platform (LLM)

idealo internet GmbH

Senior

Azure

Python

Node.js

Terraform

TypeScript

+2