Christian Weyer
Semantic AI: Why Embeddings Might Matter More Than LLMs
#1about 1 minute
Moving beyond hype with real-world generative AI
An internal company tool serves as a practical case study for applying language and embedding models to solve real business problems.
#2about 3 minutes
Integrating AI with existing enterprise data sources
The system combines API-based data from a third-party planning tool with document-based data from a Git-based knowledge base.
#3about 4 minutes
Building language-enabled universal interfaces for software
Instead of extending traditional GUIs, a universal interface allows users to interact with systems using natural language through platforms like Slack or voice.
#4about 3 minutes
Demonstrating a multi-system AI chat interface
A live demo shows how a single chat interface can query both a knowledge base and an employee availability system, providing source links to verify information.
#5about 3 minutes
Contrasting language models and embedding models
Language models are non-deterministic and generative, while embedding models are deterministic and create vector representations for comparison and retrieval.
#6about 4 minutes
Implementing retrieval-augmented generation for documents
The RAG pattern uses embeddings and a vector database to find relevant document chunks to provide as context for an LLM's answer.
#7about 4 minutes
Using LLMs for structured data and API calls
By providing a technical schema in the prompt, a language model can be forced to generate structured, machine-readable output for reliable API integration.
#8about 4 minutes
How semantic routing directs user queries
Semantic routing uses embeddings to classify a user's intent by finding the closest cluster of example questions, directing the request to the correct backend system.
#9about 1 minute
Why embeddings are the foundation of AI systems
Embeddings are crucial not just within LLMs but also for encoding meaning and enabling core architectural patterns like semantic routing and guarding.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
08:40 MIN
Understanding the role of embeddings and vector databases
Best practices: Building Enterprise Applications that leverage GenAI
44:41 MIN
Q&A on embedding calculation, ethics, and tooling
Develop AI-powered Applications with OpenAI Embeddings and Azure Search
09:55 MIN
Shifting from traditional code to AI-powered logic
WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA
30:39 MIN
Shifting from general LLMs to specialized models
ChatGPT vs Google: SEO in the Age of AI Search - Eric Enge
15:55 MIN
Shifting focus from standalone models to complete AI systems
Navigating the AI Revolution in Software Development
23:43 MIN
Key takeaways for building enterprise GenAI applications
Best practices: Building Enterprise Applications that leverage GenAI
00:04 MIN
The evolution of NLP from early models to modern LLMs
Harry Potter and the Elastic Semantic Search
00:15 MIN
Understanding the basic RAG pipeline and its limitations
Carl Lapierre - Exploring Advanced Patterns in Retrieval-Augmented Generation
Featured Partners
Related Videos
AI: Superhero or Supervillain? How and Why with Scott Hanselman
Scott Hanselman
Best practices: Building Enterprise Applications that leverage GenAI
Damir
GenAI Unpacked: Beyond Basic
Damir
Inside the Mind of an LLM
Emanuele Fabbiani
Exploring LLMs across clouds
Tomislav Tipurić
How AI Models Get Smarter
Ankit Patel
Bringing the power of AI to your application.
Krzysztof Cieślak
Unveiling the Magic: Scaling Large Language Models to Serve Millions
Patrick Koss
From learning to earning
Jobs that call for the skills explored in this talk.



Full-Stack AI Engineer - Fokus Backend, Generative AI & Cloud-Integration
Barmer
API
MySQL
NoSQL
React
Flask
+8


AI Enablement Engineer - LLM Integration & Technical Empowerment
KUEHNE + NAGEL
Intermediate
API
Python
Docker
Kubernetes
Continuous Integration
+1



AI Engineer / Machine Learning Engineer / KI-Entwickler - Schwerpunkt Cloud & MLOps
Agenda GmbH
Intermediate
API
Azure
Python
Docker
PyTorch
+9

Final Thesis Leveraging Artificial Intelligence for Work Content
RIB Deutschland GmbH