Key 3 - Multi-Step AI Workflows & Multimodal Models Explained
Автор: Duke Center for Computational Thinking
Загружено: 2025-10-31
Просмотров: 77
Modern AI isn’t just text-to-text anymore. Learn how large language models chain together multiple steps and work across different modalities—text, images, video, and code. Discover how image models work through labeling, and how outputs from one model become inputs for another in sophisticated AI workflows.
Key concepts covered:
Chaining multiple AI operations together (web search → text → images)
How image generation models work through text labeling
Converting various inputs (PDFs, web pages, images) into text for LLMs
Multi-step reasoning where one output feeds the next operation
The probabilistic nature of image generation models
Other videos in this series:
This is Key 3 of 8. After understanding model fundamentals (Key 1) and context tools (Key 2), explore Key 4 to discover why AI has 10x coding capabilities. Watch the complete playlist for full mastery.
Who this is for: Creative professionals, developers, and content creators wanting to harness AI’s full multi-step and multimodal potential for complex workflows.
#MultimodalAI #ImageGeneration #AIWorkflows #DALLE #StableDiffusion
Доступные форматы для скачивания:
Скачать видео mp4
-
Информация по загрузке: