Sam Witteveen
Google Gemini: Open Source and Deep Thinking Models - Sam Witteveen
#1about 3 minutes
Navigating the current AI hype and research secrecy
The term "AI" is often used for marketing, while the competitive landscape has made research labs more secretive about their work.
#2about 3 minutes
Understanding Google's open weights Gemma models
Gemma models are "open weights," not fully open source, meaning developers can use the model weights for commercial projects but don't have the training data or code.
#3about 2 minutes
The training process of large language models
LLMs are trained in stages, starting with pre-training to predict the next token, followed by post-training and reinforcement learning to align with human instructions.
#4about 2 minutes
Comparing proprietary Gemini and open Gemma models
Gemini models are Google's proprietary, cloud-based offerings, while Gemma models are open-weight versions that can be run on-premise or on-device.
#5about 3 minutes
How AI models are becoming smaller and more efficient
Models are made smaller and more powerful through training on vast amounts of data and using techniques like distillation to transfer knowledge from larger models.
#6about 7 minutes
The rise of multilingual and multimodal AI
Modern models like Gemma 3 are trained on over 140 languages, and multimodal models like Gemini can process text, audio, video, and images simultaneously.
#7about 4 minutes
Improving text in AI-generated images and videos
Recent models are finally getting better at rendering accurate text in images, while video generation models can create complex, realistic scenes from prompts.
#8about 3 minutes
AI tools for advanced content creation and editing
AI-powered tools can now edit video by manipulating the transcript and even insert new words in a cloned voice, simplifying the post-production process.
#9about 4 minutes
Using large language models as a learning tool
LLMs serve as powerful educational tools by explaining complex concepts at different levels, acting as a personal tutor for learning new technical skills.
#10about 6 minutes
How deep reasoning models 'think' before answering
Reasoning models improve accuracy by generating an internal monologue to break down a problem and explore possibilities before providing a final answer.
#11about 10 minutes
The evolution of AI-powered coding assistants
AI coding tools are evolving into agents that can write code, run tests, and fix errors, shifting the developer's role toward architectural and conversational guidance.
#12about 2 minutes
How to get started with Google's Gemma models
Developers can start experimenting with Gemma models through Google's AI Studio, Hugging Face, or by running them locally with tools like Ollama and LM Studio.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
03:37 MIN
Understanding Google Gemini models and capabilities
Exploring Google Gemini and Generative AI
05:49 MIN
Understanding the roles of Gemini and Gemma models
Google Gemma and Open Source AI Models - Clement Farabet
31:13 MIN
Running on-device AI in the browser with Gemini Nano
Exploring Google Gemini and Generative AI
30:13 MIN
Using open source Gemma for local AI processing
What’s New with Google Gemini?
02:50 MIN
Understanding the hackathon's open tech stack and rules
Coffee With Developers Michael Koitz
28:12 MIN
Exploring practical use cases and model limitations
Exploring Google Gemini and Generative AI
18:41 MIN
Building agents with Google Gemini and open source tools
Beyond Chatbots: How to build Agentic AI systems
18:03 MIN
GenAI applications and emerging professional roles
Enter the Brave New World of GenAI with Vector Search
Featured Partners
Related Videos
What’s New with Google Gemini?
Logan Kilpatrick
Google Gemma and Open Source AI Models - Clement Farabet
Exploring Google Gemini and Generative AI
Developer Productivity Using AI Tools and Services - Ryan J Salva
Ryan J Salva
Exploring the Future of Web AI with Google
Thomas Steiner
How to Avoid LLM Pitfalls - Mete Atamel and Guillaume Laforge
Meta Atamel & Guillaume Laforge
Coffee with Developers - Maria Apazoglou
Maria Apazoglou
Engineering Mindset in the Age of AI - Gunnar Grosch, AWS
Gunnar Grosch
From learning to earning
Jobs that call for the skills explored in this talk.

Lead Fullstack Engineer AI
Hubert Burda Media
München, Germany
€80-95K
Intermediate
React
Python
Vue.js
Langchain
+1






AIML -Machine Learning Research, DMLI
Apple
Python
PyTorch
TensorFlow
Machine Learning
Natural Language Processing

