From Hallucination to Justification: Hands-On Explainability for LLMs

About This Session

Human beings are biased and often wrong. Artificial intelligence learns from human-created data. Therefore, artificial intelligence is biased and often wrong. This has been a critical problem across machine learning applications in the last years. To break open the black box of AI models, and understand how they make decisions, the concept of explainability was introduced. Then, LLMs entered the chat. They answer our questions confidently and with a beautiful prose, even when they are making up data. Explainability then becomes essential to trust -or not- their output. But when the existing explainable AI methods cannot be directly applied to these models, what do we do? In this workshop, we will delve into the topic of explainable AI, and its importance in the current context of LLMs and agents. Starting from traditional machine learning to then focus on LLMs, we will cover the different methods that can be implemented, from well-known ones to novel proposals stemming from our internal research. We will also introduce research-proven prompting strategies, tips, and tricks to integrate explanations on third-party LLM services that are not natively explainable. Through guided exercises, you will get to peek under the hood of AI models, LLMs' behavior, and agents reasoning, by trying out these different techniques, and seeing their benefits and limitations first-hand. You will experience the risks and challenges that generative AI and agentic AI bring when implementing explainability, and learn practical ways to tackle them. By the end of the workshop, you will leave with a mental toolkit you can apply immediately to know: - When to trust (or distrust) LLMs - Which explainability capabilities to implement (and how) on your RAG systems and LLM-based workflows (or which ones to look for when choosing a third-party service), and - How to make LLMs' behavior more predictable, transparent, and ultimately safe

Speakers

Lucía Conde-Moreno

Software Engineer · Info Support

Software Engineer at Info Support

Read bio

Lucía Conde-Moreno is Head of the AI Research Center at Info Support, where she also works as a consultant software engineer specializing in data and AI applications. She is known as a Jack Of All Trades by her colleagues, having worked in varied roles ranging from .NET or Java developer to data scientist or machine learning engineer. She has worked for different national and international clients, in diverse fields such as finance, health care, energy, or education. She is part of the AI Champions chapter for promoting AI-augmented engineering tools, and she is responsible for supervising internal research in subfields of AI like explainability or computer vision. When she is not working, she is busy switching across random hobbies, from filmmaking to DJing. She holds a MSc in Computer Science, and a BSc in Telecommunications Engineering.

Tessel Haagen

Consultant Data AI · Info Support

Consultant Data AI at Info Support

Read bio

Tessel is a consultant in data & AI at Info Support, specialized in interpretable AI. In her daily work, she applies AI-augmented engineering to build and evaluate intelligent systems. She is known as an enthusiastic knowledge source and an insatiably curious lifelong learner. Outside of work, she channels her strategic thinking into board games (with a soft spot for complex strategy games) and escapes into fantasy novels. With an MSc in Artificial Intelligence, an MSc in Computer Science, and an MA in Linguistics, she combines a strong technical foundation with a deep interest in language and reasoning.