🎙 Interactive Latent Diffusion: Steering Text-to-Image Models
Автор: Data Sanity Talks
Загружено: 2025-06-27
Просмотров: 45
🚀 Data Sanity Talks Belgrade, June 2-3
🎙 Interactive Latent Diffusion: Steering Text-to-Image Models
Speaker: Nick Knizev, Co-Founder @ Wizium.ai, Ex-Meta
What if you could collaborate with an AI to generate images — not just prompt it and hope for the best? In this talk, Nick Knizev shares his Best Paper–nominated work on Interactive Latent Diffusion Models (IELDM), a new approach that lets users steer text-to-image generation in real time. This allows users to guide image generation by selecting preferred outputs and interacting with specific image regions, helping the model learn and adapt to user intent. These targeted refinements and image recombination help to reduce the trial-and-error, showing strong performance even with complex prompts that typically challenge diffusion models.
🚀 Learn more: https://datasanity.dev/
Доступные форматы для скачивания:
Скачать видео mp4
-
Информация по загрузке: