Timo Zander
In the Dawn of the AI: Understanding and implementing AI-generated images
#1about 3 minutes
The rise of advanced AI text-to-image synthesis
AI models like OpenAI's DALL-E 2 can now generate photorealistic and culturally specific images directly from natural language prompts.
#2about 4 minutes
How generative adversarial networks (GANs) work
GANs use a two-player system where a generator creates fake images and a discriminator judges them against real ones, forcing both to improve.
#3about 2 minutes
The mathematical foundation of the GAN training process
The training process is a min-max game governed by a value function, where the discriminator maximizes accuracy and the generator minimizes it by creating better fakes.
#4about 3 minutes
Overcoming mode collapse for diverse outputs
Mode collapse, where the generator produces limited variety, can be fixed by introducing a similarity check that penalizes a lack of diversity in outputs.
#5about 3 minutes
Fixing non-convergence and vanishing gradient issues
Address training deadlocks with a two-timescale update rule and solve the vanishing gradient problem by replacing sigmoid activation functions with ReLU.
#6about 4 minutes
Using progressive GANs for high-resolution image generation
Progressive GANs achieve high-resolution results by starting with a low-resolution image and gradually fading in new layers to increase detail during training.
#7about 3 minutes
Creating controllable landscapes with GauGAN
GauGAN allows users to control image generation by providing a segmentation map for layout and a style image to set the overall mood and color palette.
#8about 8 minutes
The future and ethical challenges of AI image generation
The Q&A session explores the societal impact of AI-generated images, including deepfake detection, AI safety, AI-powered editing, and legal ownership.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
03:55 MIN
Understanding how generative AI models create content
The shadows that follow the AI generative models
13:57 MIN
The recent evolution of generative AI models
Enter the Brave New World of GenAI with Vector Search
00:09 MIN
Understanding the rapid evolution of generative AI tools
HR ROBO SAPIENS: Decoding AI Agents and Workflow Automation for Modern Recruitment
21:49 MIN
The serious threat of malicious and illegal AI use
The shadows that follow the AI generative models
01:45 MIN
The hype and promise of generative AI
AI'll Be Back: Generative AI in Image, Video, and Audio Production
56:11 MIN
Challenges and ethical concerns in generative AI
Enter the Brave New World of GenAI with Vector Search
02:34 MIN
From image recognition to modern generative AI
ChatGPT: Create a Presentation!
02:42 MIN
Overcoming the common challenges in generative AI adoption
From Traction to Production: Maturing your LLMOps step by step
Featured Partners
Related Videos
Deepfakes in Realtime - How Neural Networks Are Changing Our World
Thomas Endres & Martin Förtsch & Jonas Mayer
The AI Elections: How Technology Could Shape Public Sentiment
Martin Förtsch & Thomas Endres
Multimodal Generative AI Demystified
Ekaterina Sirazitdinova
Your imaginations is (no longer) the limit: how Generative AI empowers people to be creative
David Estevez
AI'll Be Back: Generative AI in Image, Video, and Audio Production
Fabian Pottbäcker, Thomas Endres & Martin Foertsch
GenAI after the Hype: Transforming Organizations with GenAI-based Agents
Alexander Birke & Silke Eggert
ChatGPT: Create a Presentation!
Markus Walker
AI: Superhero or Supervillain? How and Why with Scott Hanselman
Scott Hanselman
From learning to earning
Jobs that call for the skills explored in this talk.
Deep Learning Engineer, Visual Generative AINVIDIA
Nvidia
Bramley, United Kingdom
Senior
Python
Docker
PyTorch
TensorFlow
Microservices


