Rainer Stropek
Develop AI-powered Applications with OpenAI Embeddings and Azure Search
#1about 3 minutes
Understanding embedding vectors as numerical representations
Embedding vectors convert complex concepts like text or personality into multi-dimensional numerical arrays, enabling comparison and clustering.
#2about 7 minutes
Working with the OpenAI embeddings API and cosine similarity
The OpenAI API provides an endpoint to generate a 1,536-dimensional vector for a given text, and vector similarity can be efficiently calculated using a dot product.
#3about 5 minutes
Building custom applications with the OpenAI chat API
The chat completions API allows developers to build custom applications by sending a model the entire chat history, including system prompts and user messages.
#4about 3 minutes
Implementing the Retrieval-Augmented Generation (RAG) pattern
The RAG pattern enhances LLM responses by first retrieving relevant facts from a private knowledge base using vector search and then injecting that context into the prompt.
#5about 4 minutes
Demo overview of building a school wiki assistant
A practical demonstration shows how to build a Q&A assistant for a school's private wiki using a crawler, an indexer, and a query application.
#6about 8 minutes
Step 1: Crawling and pre-processing the source data
The first step in the RAG pipeline involves building a custom crawler to extract, clean, and convert source data into a usable format like Markdown.
#7about 6 minutes
Step 2: Indexing embeddings into a vector database
An indexer application iterates through pre-processed documents, calculates their embeddings via the OpenAI API, and stores them in Azure Cognitive Search for fast retrieval.
#8about 5 minutes
Step 3: Querying the system using the RAG pattern
The query application generates an embedding for the user's question, performs a vector search to find relevant documents, and injects them into a system prompt for the LLM.
#9about 5 minutes
Live demonstration of the wiki Q&A assistant
The command-line assistant successfully answers specific questions about school policies by retrieving information from the wiki, even handling multi-language queries.
#10about 13 minutes
Q&A on embedding calculation, ethics, and tooling
The speaker answers audience questions about how embeddings are calculated, ensuring answer correctness, responsible AI development, and recommended developer tools.
Related jobs
Jobs that call for the skills explored in this talk.
Picnic Technologies B.V.
Amsterdam, Netherlands
Intermediate
Senior
Python
Structured Query Language (SQL)
+1
ROSEN Technology and Research Center GmbH
Osnabrück, Germany
Senior
TypeScript
React
+3
Matching moments
04:57 MIN
Increasing the value of talk recordings post-event
Cat Herding with Lions and Tigers - Christian Heilmann
01:32 MIN
Organizing a developer conference for 15,000 attendees
Cat Herding with Lions and Tigers - Christian Heilmann
03:17 MIN
Selecting strategic partners and essential event tools
Cat Herding with Lions and Tigers - Christian Heilmann
03:15 MIN
The future of recruiting beyond talent acquisition
What 2025 Taught Us: A Year-End Special with Hung Lee
03:28 MIN
Why corporate AI adoption lags behind the hype
What 2025 Taught Us: A Year-End Special with Hung Lee
04:27 MIN
Moving beyond headcount to solve business problems
What 2025 Taught Us: A Year-End Special with Hung Lee
02:44 MIN
Rapid-fire thoughts on the future of work
What 2025 Taught Us: A Year-End Special with Hung Lee
03:48 MIN
Automating formal processes risks losing informal human value
What 2025 Taught Us: A Year-End Special with Hung Lee
Featured Partners
Related Videos
Inside the AI Revolution: How Microsoft is Empowering the World to Achieve More
Simi Olabisi
Best practices: Building Enterprise Applications that leverage GenAI
Damir
GenAI Unpacked: Beyond Basic
Damir
WeAreDevelopers LIVE - Vector Similarity Search Patterns for Efficiency and more
Chris Heilmann, Daniel Cranney, Raphael De Lio & Developer Advocate at Redis
From Syntax to Singularity: AI’s Impact on Developer Roles
Anna Fritsch-Weninger
Building Blocks of RAG: From Understanding to Implementation
Ashish Sharma
Azure AI Foundry for Developers: Open Tools, Scalable Agents, Real Impact
Oliver Will
Enter the Brave New World of GenAI with Vector Search
Mary Grygleski
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

Agenda GmbH
Raubling, Germany
Remote
Intermediate
API
Azure
Python
Docker
+10

Agenda GmbH
Rosenheim, Germany
Intermediate
API
Azure
Python
Docker
PyTorch
+9




FRG Technology Consulting
Intermediate
Azure
Python
Machine Learning

Imec
Azure
Python
PyTorch
TensorFlow
Computer Vision
+1

Mixsoft
Remote
API
Python
PyTorch
TensorFlow
+1

autonomous-teaming
München, Germany
Remote
C++
GIT
Linux
Python
+1