Phil Nash

Aug 20, 2024 • World Congress 2024

Build RAG from Scratch

You don't need complex tools to start with RAG. This session builds a surprisingly effective system from scratch using basic vectorization and cosine similarity.

#1about 3 minutes

Why large language models need retrieval augmented generation

Large language models have knowledge cutoffs and lack access to private data, a problem solved by providing relevant context at query time using RAG.

#2about 1 minute

How similarity search and vector embeddings power RAG

RAG relies on similarity search, not keyword search, which captures meaning by converting text into numerical representations called vector embeddings.

#3about 6 minutes

Building a simple bag-of-words vectorizer from scratch

A basic vector embedding can be created by tokenizing text, building a vocabulary of unique words, and representing each document as a vector of word counts.

#4about 8 minutes

Comparing document vectors using cosine similarity

Cosine similarity measures the angle between two vectors to determine their semantic closeness by focusing on direction (meaning) rather than magnitude.

#5about 3 minutes

Understanding the limitations of a bag-of-words model

The simple bag-of-words model is sensitive to vocabulary, slow to scale, and fails to capture nuanced semantic meaning like word order or synonyms.

#6about 4 minutes

Using professional embedding models and vector databases

Production RAG systems use sophisticated embedding models and specialized vector databases for efficient, accurate, and scalable similarity search.

#7about 2 minutes

Exploring advanced RAG techniques and other applications

Beyond basic similarity search, techniques like ColBERT and knowledge graphs can improve retrieval accuracy, and vector search can power features like related content recommendations.

2 days ago

AI Software Engineer (m/f/d)

Sunhat
Köln, Germany

Remote

Senior

14 days ago

Senior Machine Learning Engineer (f/m/d)

MARKT-PILOT GmbH
Stuttgart, Germany

Remote

Senior

8 days ago

Senior Researcher for Generative AI

Dynatrace
Linz, Austria

Senior

Featured Partners

Building Blocks of RAG: From Understanding to Implementation

Building Blocks of RAG: From Understanding to Implementation

Ashish Sharma

about 10 months ago • WeAreDevelopers LIVE

Carl Lapierre - Exploring Advanced Patterns in Retrieval-Augmented Generation

Carl Lapierre - Exploring Advanced Patterns in Retrieval-Augmented Generation

Carl Lapierre

about 11 months ago • World Congress 2024

Make it simple, using generative AI to accelerate learning

Make it simple, using generative AI to accelerate learning

Duan Lightfoot

about a year ago • World Congress 2024

Large Language Models ❤️ Knowledge Graphs

Large Language Models ❤️ Knowledge Graphs

Michael Hunger

about a year ago • World Congress 2024

Building Real-Time AI/ML Agents with Distributed Data using Apache Cassandra and Astra DB

Building Real-Time AI/ML Agents with Distributed Data using Apache Cassandra and Astra DB

Dieter Flick

about 2 years ago • World Congress 2023

Livecoding with AI

Livecoding with AI

Rainer Stropek

about a year ago • World Congress 2024

RAG like a hero with Docling

RAG like a hero with Docling

Alex Soto, Markus Eisele

about 2 months ago • World Congress 2025

Martin O'Hanlon - Make LLMs make sense with GraphRAG

Martin O'Hanlon - Make LLMs make sense with GraphRAG

Martin O'Hanlon

about 6 months ago • WeAreDevelopers LIVE

From learning to earning

Jobs that call for the skills explored in this talk.

Senior Backend Engineer – AI Integration (m/w/x)

1 month ago

Senior Backend Engineer – AI Integration (m/w/x)

chatlyn GmbH
Vienna, Austria

Senior

JavaScript

AI-assisted coding tools

5 days ago

Agentic AI Architect - Python, LLMs & NLP

FRG Technology Consulting

Intermediate

Azure

Python

Machine Learning

5 days ago

Generative AI Architect

GlobalSysInfo
Municipality of Madrid, Spain

Machine Learning

yesterday

Generative AI Engineer

Iryss AI

Remote

€53K

API

Azure

Python

+6

5 days ago

R&D AI Software Engineer / End-to-End Machine Learning Engineer / RAG and LLM

Pathway
Paris, France

Remote

€72-75K

GIT

Python

Unit Testing

+2

today

Agentic AI Developer

Municipality of Madrid, Spain

€40-60K

Python

Docker

Kubernetes

yesterday

AI/ML Team Lead - Generative AI (LLMs, AWS)

Provectus
Canton de Saint-Mihiel, France

Remote

€96K

Senior

Python

PyTorch

TensorFlow

+4

yesterday

LLM-AI Engineer | Python | Arquitecturas RAG (100% remoto)

Diverger
Municipality of Madrid, Spain

Azure

Python

Amazon Web Services (AWS)