What comes after ChatGPT? Vector Databases - the Simple and powerful future of ML?
What if your AI could remember everything? Vector databases give LLMs a long-term memory, eliminating hallucinations and making them smarter with your private data.
#1about 3 minutes
Understanding the limitations of large language models
Large language models like ChatGPT face challenges with token limits and incorporating private data, which restricts their use on large documents or custom knowledge bases.
#2about 3 minutes
Why vector databases are attracting major investment
Unlike relational or NoSQL databases, vector databases are designed to store and semantically search unstructured data, filling a critical gap in the data landscape.
#3about 4 minutes
The challenge of searching unstructured data
Manually tagging unstructured data like images and documents is inconsistent and subjective, making it an inefficient way to enable search.
#4about 5 minutes
How vector embeddings capture semantic meaning
Machine learning models convert unstructured data into numerical representations called embeddings, where semantically similar items are positioned closely in a high-dimensional space.
#5about 5 minutes
Visualizing relationships in a vector space
A demonstration with Google's Projector TensorFlow shows how words like "king" and "queen" are clustered together, visually representing their semantic proximity.
#6about 6 minutes
Performing fast similarity search with vectors
Vector databases use mathematical formulas to measure the distance between embeddings and employ indexing techniques like Approximate Nearest Neighbor (ANN) for high-speed search.
#7about 4 minutes
An overview of the vector database market
A look at popular vector databases like Pinecone, Weaviate, and Milvus, including their features, hosting models, and integrations with platforms like Hugging Face.
#8about 4 minutes
Building applications like intrusion and face detection
Vector databases can power real-world applications such as intrusion detection systems and face similarity matching without needing constant model retraining.
#9about 6 minutes
Augmenting ChatGPT with a long-term memory
The Retrieval-Augmented Generation (RAG) pattern uses a vector database to find relevant data chunks, providing LLMs with the right context to answer questions accurately.
#10about 16 minutes
Exploring more applications for vector search
Vector search enables a wide range of applications including recommendation systems, document deduplication, time-series analysis, and advanced product search.
Related jobs
Jobs that call for the skills explored in this talk.
With AIs wide open - WeAreDevelopers at All Things Open 2025Last week our VP of Developer Relations, Chris Heilmann, flew to Raleigh, North Carolina to present at All Things Open . An excellent event he had spoken at a few times in the past and this being the “Lucky 13” edition, he didn’t hesitate to come and...
Adrien Book
Top 5 ChatGPT Plugins for DevelopersThe last few weeks have been very interesting in the AI space. We saw the release of a new updated version of ChatGPT from GPT-3.5 to GPT-4. Within a couple of days, Google soft-launched their competitor AI chatbot, Bard (available in the US and UK)....
Benjamin Ruschin
Lessons for Vibe Coders and DevelopersIn late July 2025, the women-only dating-advice app Tea went viral for all the wrong reasons.
Marketed as a safe, private space to anonymously flag men for “red-flag” behavior, hackers attacked the app, accessing over 70,000 user-submitted images in...
From learning to earning
Jobs that call for the skills explored in this talk.