Chris Heilmann, Daniel Cranney, Raphael De Lio & Developer Advocate at Redis
WeAreDevelopers LIVE - Vector Similarity Search Patterns for Efficiency and more
#1about 8 minutes
Getting hired through open source and passion projects
Hear how contributing to open source and sharing your work publicly can lead directly to job opportunities in developer advocacy.
#2about 5 minutes
How critical analysis can accelerate your career
Discover how publicly analyzing and improving upon existing technologies can make you a highly visible and attractive candidate for top companies.
#3about 3 minutes
The hidden costs of large LLM context windows
Understand why simply using larger context windows in models like GPT-5 is not a scalable or cost-effective solution for production applications.
#4about 3 minutes
A quick primer on vectors and vector search
A brief explanation of how text is converted into numerical vectors to represent its semantic meaning, enabling similarity searches.
#5about 9 minutes
Using semantic classification to categorize text
Learn how to use a vector database with reference examples to classify text, avoiding costly LLM calls for simple categorization tasks.
#6about 5 minutes
Implementing semantic routing for tool calling and guardrails
Discover how to use semantic routing to direct user prompts to the correct function or to block inappropriate topics without involving an LLM.
#7about 6 minutes
Reducing latency and cost with semantic caching
Implement semantic caching to store and retrieve answers for semantically similar user questions, drastically reducing redundant LLM calls and improving response time.
#8about 6 minutes
Optimizing accuracy for classification and tool calling
Explore techniques like self-improvement, hybrid fallbacks, and prompt chunking to fine-tune and improve the accuracy of your semantic patterns.
#9about 4 minutes
Advanced caching with specialized embedding models
Learn how to avoid common caching pitfalls, such as misinterpreting negation, by using specialized embedding models trained for semantic caching.
#10about 16 minutes
Q&A on data freshness, persistence, and management
The discussion covers practical considerations like preventing stale cache data with TTL, managing data ownership, and how Redis handles persistence.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
26:04 MIN
Exploring advanced RAG techniques and other applications
Build RAG from Scratch
43:14 MIN
Practical use cases for vector embeddings
Enter the Brave New World of GenAI with Vector Search
09:42 MIN
How to choose the right tools for your AI application
Building AI Applications with LangChain and Node.js
43:50 MIN
Exploring more applications for vector search
What comes after ChatGPT? Vector Databases - the Simple and powerful future of ML?
04:35 MIN
Exploring common AI application patterns
Building AI Applications with LangChain and Node.js
20:54 MIN
Live code demo of various AI application patterns
Building AI Applications with LangChain and Node.js
09:55 MIN
Shifting from traditional code to AI-powered logic
WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA
14:10 MIN
Leveraging open software and AI for code development
The Future of Computing: AI Technologies in the Exascale Era
Featured Partners
Related Videos
Reducing LLM Calls with Vector Search Patterns - Raphael De Lio (Redis)
Develop AI-powered Applications with OpenAI Embeddings and Azure Search
Rainer Stropek
What comes after ChatGPT? Vector Databases - the Simple and powerful future of ML?
Erik Bamberg
Enter the Brave New World of GenAI with Vector Search
Mary Grygleski
How to Avoid LLM Pitfalls - Mete Atamel and Guillaume Laforge
Meta Atamel & Guillaume Laforge
WeAreDevelopers LIVE – AI vs the Web & AI in Browsers
Chris Heilmann, Daniel Cranney & Raymond Camden
Accelerating GenAI Development: Harnessing Astra DB Vector Store and Langflow for LLM-Powered Apps
Dieter Flick & Michel de Ru
Building Real-Time AI/ML Agents with Distributed Data using Apache Cassandra and Astra DB
Dieter Flick
From learning to earning
Jobs that call for the skills explored in this talk.


Lead Fullstack Engineer AI
Hubert Burda Media
München, Germany
€80-95K
Intermediate
React
Python
Vue.js
Langchain
+1

![Senior Software Engineer [TypeScript] (Prisma Postgres)](https://wearedevelopers.imgix.net/company/283ba9dbbab3649de02b9b49e6284fd9/cover/oKWz2s90Z218LE8pFthP.png?w=400&ar=3.55&fit=crop&crop=entropy&auto=compress,format)
Senior Software Engineer [TypeScript] (Prisma Postgres)
Prisma
Remote
Senior
Node.js
TypeScript
PostgreSQL

Machine Learning Engineer
Picnic Technologies B.V.
Amsterdam, Netherlands
Intermediate
Senior
Python
Machine Learning
Structured Query Language (SQL)


Team Lead and Senior Software Engineer with focus on AI
Dynatrace
Linz, Austria
Senior
Java
Team Leadership


Machine Learning Algorithm/SW Optimization Engineer
Leuven MindGate
Python
PyTorch
TensorFlow
Machine Learning