Phil Nash

Aug 20, 2024 • World Congress 2024

Build RAG from Scratch

You don't need complex tools to start with RAG. This session builds a surprisingly effective system from scratch using basic vectorization and cosine similarity.

#1about 3 minutes

Why large language models need retrieval augmented generation

Large language models have knowledge cutoffs and lack access to private data, a problem solved by providing relevant context at query time using RAG.

#2about 1 minute

How similarity search and vector embeddings power RAG

RAG relies on similarity search, not keyword search, which captures meaning by converting text into numerical representations called vector embeddings.

#3about 6 minutes

Building a simple bag-of-words vectorizer from scratch

A basic vector embedding can be created by tokenizing text, building a vocabulary of unique words, and representing each document as a vector of word counts.

#4about 8 minutes

Comparing document vectors using cosine similarity

Cosine similarity measures the angle between two vectors to determine their semantic closeness by focusing on direction (meaning) rather than magnitude.

#5about 3 minutes

Understanding the limitations of a bag-of-words model

The simple bag-of-words model is sensitive to vocabulary, slow to scale, and fails to capture nuanced semantic meaning like word order or synonyms.

#6about 4 minutes

Using professional embedding models and vector databases

Production RAG systems use sophisticated embedding models and specialized vector databases for efficient, accurate, and scalable similarity search.

#7about 2 minutes

Exploring advanced RAG techniques and other applications

Beyond basic similarity search, techniques like ColBERT and knowledge graphs can improve retrieval accuracy, and vector search can power features like related content recommendations.

24 days ago

AI Software Engineer (m/f/d)

Sunhat
Köln, Germany

Remote

Senior

3 days ago

AI Engineer (m/w/d)

Riverty
Berlin, Germany

Intermediate

10 days ago

Lead Fullstack Engineer AI

Hubert Burda Media
München, Germany

Intermediate

Understanding Retrieval-Augmented Generation (RAG)

06:05 MIN

Understanding Retrieval-Augmented Generation (RAG)

Graphs and RAGs Everywhere... But What Are They? - Andreas Kollegger - Neo4j

Understanding retrieval-augmented generation (RAG)

15:49 MIN

Understanding retrieval-augmented generation (RAG)

Exploring LLMs across clouds

A deep dive into retrieval-augmented generation

23:59 MIN

A deep dive into retrieval-augmented generation

Lies, Damned Lies and Large Language Models

Introducing retrieval-augmented generation (RAG)

07:24 MIN

Introducing retrieval-augmented generation (RAG)

Martin O'Hanlon - Make LLMs make sense with GraphRAG

How RAG provides LLMs with up-to-date context

01:32 MIN

How RAG provides LLMs with up-to-date context

How to scrape modern websites to feed AI agents

Code walkthrough for building a RAG-based chatbot

39:05 MIN

Code walkthrough for building a RAG-based chatbot

Creating Industry ready solutions with LLM Models

Implementing the Retrieval-Augmented Generation (RAG) pattern

15:24 MIN

Implementing the Retrieval-Augmented Generation (RAG) pattern

Develop AI-powered Applications with OpenAI Embeddings and Azure Search

Augmenting ChatGPT with a long-term memory

37:37 MIN

Augmenting ChatGPT with a long-term memory

What comes after ChatGPT? Vector Databases - the Simple and powerful future of ML?

Featured Partners

Building Blocks of RAG: From Understanding to Implementation

Building Blocks of RAG: From Understanding to Implementation

Ashish Sharma

about 11 months ago • WeAreDevelopers LIVE

Carl Lapierre - Exploring Advanced Patterns in Retrieval-Augmented Generation

Carl Lapierre - Exploring Advanced Patterns in Retrieval-Augmented Generation

Carl Lapierre

about a year ago • World Congress 2024

Make it simple, using generative AI to accelerate learning

Make it simple, using generative AI to accelerate learning

Duan Lightfoot

about a year ago • World Congress 2024

Large Language Models ❤️ Knowledge Graphs

Large Language Models ❤️ Knowledge Graphs

Michael Hunger

about a year ago • World Congress 2024

Building Real-Time AI/ML Agents with Distributed Data using Apache Cassandra and Astra DB

Building Real-Time AI/ML Agents with Distributed Data using Apache Cassandra and Astra DB

Dieter Flick

about 2 years ago • World Congress 2023

Livecoding with AI

Livecoding with AI

Rainer Stropek

about a year ago • World Congress 2024

RAG like a hero with Docling

RAG like a hero with Docling

Alex Soto & Markus Eisele

about 2 months ago • World Congress 2025

Martin O'Hanlon - Make LLMs make sense with GraphRAG

Martin O'Hanlon - Make LLMs make sense with GraphRAG

Martin O'Hanlon

about 7 months ago • WeAreDevelopers LIVE

From learning to earning

Jobs that call for the skills explored in this talk.

Senior Data Scientist

3 months ago

Senior Data Scientist

SMG Swiss Marketplace Group
Belgrade, Serbia

Senior

Domain Architect Ricardo Platform (f/m/d) | 80-100% | Hybrid working model | Valbonne France

2 months ago

Domain Architect Ricardo Platform (f/m/d) | 80-100% | Hybrid working model | Valbonne France

SMG Swiss Marketplace Group
Canton de Valbonne, France

Senior

AI Agent Builder & Experimenter (Fullstack)

today

AI Agent Builder & Experimenter (Fullstack)

autonomous-teaming

Remote

API

React

Python

TypeScript

Software Engineer - KI & Retrieval (RAG/Azure)

today

Software Engineer - KI & Retrieval (RAG/Azure)

Jurafuchs

Remote

API

Azure

Python

Node.js

+4

AI Web Engineer

today

AI Web Engineer

CONVERSAL - BUGGENHOUDT

Remote

API

WordPress

AI Developer / Generative AI Engineer

today

AI Developer / Generative AI Engineer

Cognizant

API

ETL

REST

Azure

Neo4j

+4

AI Engineer / Developer - Generative AI

today

AI Engineer / Developer - Generative AI

Médiane Benelux

.NET

REST

Azure

DevOps

Python

+4

Conversational Designer:in - UX & AI

today

Conversational Designer:in - UX & AI

Ideabay GmbH

Remote

Figma

Data scientist large language models

today

Data scientist large language models

Radar VZW

C++

Python

Data analysis

Continuous Integration