Carl Lapierre
Carl Lapierre - Exploring Advanced Patterns in Retrieval-Augmented Generation
#1about 4 minutes
Understanding the basic RAG pipeline and its limitations
The standard retrieval-augmented generation pipeline is reviewed, highlighting common business needs like explainability and accuracy that require more advanced solutions.
#2about 3 minutes
Improving accuracy with advanced search and data preparation
Techniques like hybrid search, post-retrieval reranking, and recursive data summarization with Raptor are used to enhance retrieval accuracy.
#3about 3 minutes
Introducing agentic RAG for complex reasoning tasks
Agentic RAG systems add reasoning capabilities to basic pipelines by incorporating tools, memory, planning, and reflection to work towards a goal.
#4about 2 minutes
Implementing self-critique with the corrective RAG pattern
The corrective RAG pattern improves reliability by adding a grading step to evaluate retrieved documents for relevance before generating a response.
#5about 1 minute
Expanding search with query translation and RAG fusion
RAG fusion rewrites a single user query from multiple perspectives to cast a wider net and improve the chances of finding relevant information.
#6about 1 minute
Enabling actions with tool use and function calling
Providing an LLM with a defined set of tools, such as vector search or a calculator, allows it to perform specific actions beyond text generation.
#7about 3 minutes
Orchestrating tasks with advanced planning techniques
Planning evolves from simple routing to complex, parallel execution using directed acyclic graphs (DAGs) generated by an LLM compiler.
#8about 1 minute
Exploring experimental multi-agent collaboration frameworks
Hierarchical multi-agent systems create a separation of concerns by allowing specialized atomic agents to delegate tasks and collaborate on complex queries.
#9about 2 minutes
Key considerations for deploying RAG systems in production
Successfully deploying RAG requires managing token costs and latency, implementing guardrails, ensuring data quality, and being aware of the unreliability of advanced planning.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
07:24 MIN
Introducing retrieval-augmented generation (RAG)
Martin O'Hanlon - Make LLMs make sense with GraphRAG
15:49 MIN
Understanding retrieval-augmented generation (RAG)
Exploring LLMs across clouds
15:24 MIN
Implementing the Retrieval-Augmented Generation (RAG) pattern
Develop AI-powered Applications with OpenAI Embeddings and Azure Search
06:05 MIN
Understanding Retrieval-Augmented Generation (RAG)
Graphs and RAGs Everywhere... But What Are They? - Andreas Kollegger - Neo4j
01:32 MIN
How RAG provides LLMs with up-to-date context
How to scrape modern websites to feed AI agents
13:21 MIN
Implementing retrieval-augmented generation for documents
Semantic AI: Why Embeddings Might Matter More Than LLMs
35:15 MIN
Advanced techniques like RAG, function calling, and fine-tuning
Creating Industry ready solutions with LLM Models
08:01 MIN
How RAG solves LLM limitations
Building Blocks of RAG: From Understanding to Implementation
Featured Partners
Related Videos
Building Blocks of RAG: From Understanding to Implementation
Ashish Sharma
Build RAG from Scratch
Phil Nash
Large Language Models ❤️ Knowledge Graphs
Michael Hunger
RAG like a hero with Docling
Alex Soto & Markus Eisele
Building Real-Time AI/ML Agents with Distributed Data using Apache Cassandra and Astra DB
Dieter Flick
Accelerating GenAI Development: Harnessing Astra DB Vector Store and Langflow for LLM-Powered Apps
Dieter Flick & Michel de Ru
Martin O'Hanlon - Make LLMs make sense with GraphRAG
Martin O'Hanlon
Graphs and RAGs Everywhere... But What Are They? - Andreas Kollegger - Neo4j
From learning to earning
Jobs that call for the skills explored in this talk.

Cloud & Reliability Engineer - Fokus Machine Learning & Generative AI
Barmer
Bash
Docker
Grafana
Terraform
Prometheus
+4

Machine Learning Engineer - AI-Features, Agenten & Multi Agenten Workflows
dmrz
API
.NET
DevOps
Python
Microservices
+2





AIML -Machine Learning Research, DMLI
Apple
Python
PyTorch
TensorFlow
Machine Learning
Natural Language Processing

Machine Learning Algorithm/SW Optimization Engineer
Leuven MindGate
Python
PyTorch
TensorFlow
Machine Learning

Data Scientist- Python/MLflow-NLP/MLOps/Generative AI
ITech Consult AG
Azure
Python
PyTorch
TensorFlow
Machine Learning