Jodie Burchell
Lies, Damned Lies and Large Language Models
#1about 2 minutes
Understanding the dual nature of large language models
LLMs can generate both creative, coherent text and factually incorrect "hallucinations," posing a significant challenge for real-world applications.
#2about 4 minutes
The architecture and evolution of LLMs
The combination of the scalable Transformer architecture and massive text datasets enables models like GPT to develop "parametric knowledge" as they grow in size.
#3about 3 minutes
How training data quality influences model behavior
The quality of web-scraped datasets like Common Crawl, even after filtering, directly contributes to model hallucinations by embedding misinformation.
#4about 2 minutes
Differentiating between faithfulness and factuality hallucinations
Hallucinations are categorized as either faithfulness errors, which contradict a given source text, or factuality errors, which stem from incorrect learned knowledge.
#5about 3 minutes
Using the TruthfulQA dataset to measure misinformation
The TruthfulQA dataset provides a benchmark for measuring an LLM's tendency to repeat common misconceptions and conspiracy theories across various categories.
#6about 6 minutes
A practical guide to benchmarking LLM hallucinations
A step-by-step demonstration shows how to use Python, LangChain, and Hugging Face Datasets to run the TruthfulQA benchmark on a model like GPT-3.5 Turbo.
#7about 4 minutes
Exploring strategies to reduce LLM hallucinations
Key techniques to mitigate hallucinations include careful prompt crafting, domain-specific fine-tuning, output evaluation, and retrieval-augmented generation (RAG).
#8about 4 minutes
A deep dive into retrieval-augmented generation
RAG reduces hallucinations by augmenting prompts with relevant, up-to-date information retrieved from a vector database of document embeddings.
#9about 2 minutes
Overcoming challenges with advanced RAG techniques
Naive RAG can fail due to poor retrieval or generation, but advanced methods like Rowan selectively apply retrieval to significantly improve factuality.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
00:02 MIN
Understanding the problem of LLM hallucinations
Martin O'Hanlon - Make LLMs make sense with GraphRAG
04:08 MIN
Explaining how large language models work and why they hallucinate
Innovating Developer Tools with AI: Insights from GitHub Next
00:05 MIN
Why web data is essential for training large language models
How to scrape modern websites to feed AI agents
09:30 MIN
The challenge of correctness and model hallucination
The shadows that follow the AI generative models
23:53 MIN
Dealing with different types of LLM hallucinations
How we built an AI-powered code reviewer in 80 hours
06:11 MIN
Key challenges of LLMs like hallucination
Building Blocks of RAG: From Understanding to Implementation
22:28 MIN
Navigating the common challenges of building with LLMs
Creating Industry ready solutions with LLM Models
44:41 MIN
Q&A on embedding calculation, ethics, and tooling
Develop AI-powered Applications with OpenAI Embeddings and Azure Search
Featured Partners
Related Videos
Creating Industry ready solutions with LLM Models
Vijay Krishan Gupta & Gauravdeep Singh Lotey
Inside the Mind of an LLM
Emanuele Fabbiani
How to Avoid LLM Pitfalls - Mete Atamel and Guillaume Laforge
Meta Atamel & Guillaume Laforge
Large Language Models ❤️ Knowledge Graphs
Michael Hunger
Martin O'Hanlon - Make LLMs make sense with GraphRAG
Martin O'Hanlon
Three years of putting LLMs into Software - Lessons learned
Simon A.T. Jiménez
Give Your LLMs a Left Brain
Stephen Chin
Using LLMs in your Product
Daniel Töws
From learning to earning
Jobs that call for the skills explored in this talk.


Senior AI Software Developer & Mentor
Dynatrace
Linz, Austria
Senior
Java
TypeScript
AI Frameworks
Agile Methodologies

AIML -Machine Learning Research, DMLI
Apple
Python
PyTorch
TensorFlow
Machine Learning
Natural Language Processing


Data Scientist- Python/MLflow-NLP/MLOps/Generative AI
ITech Consult AG
Azure
Python
PyTorch
TensorFlow
Machine Learning



