[V-JEPA] Beyond Pixels: V-JEPA 2 and the Shift to Action-Conditioned Video Prediction.

Автор: AI Podcast Series. Byte Goose AI.

Загружено: 2026-01-14

Просмотров: 142

Описание:

For years, the 'holy grail' of robotics has been a machine that can walk into a room it’s never seen, look at an object it’s never touched, and understand exactly how to move it.

Until recently, we tried to solve this by training robots on millions of specific examples—'pick up the red cup,' 'turn the blue knob.' But today, the paradigm is shifting from generative mimicry to predictive world models.

Today, we are unpacking V-JEPA 2. This isn't just another video model; it is a task-agnostic powerhouse that learns the 'physics of the world' simply by watching. By predicting the missing pieces of a video sequence through a sophisticated masking strategy, V-JEPA 2 builds an internal map of dynamics that allows for something incredible: zero-shot robot control.

In this episode, we’re breaking down the three pillars of this breakthrough:

Action-Conditioned Predictions: How the model simulates the outcomes of a robotic movement before the motor even turns.

Progressive-Resolution Training: The secret to scaling these models to high-res, long-form video without crashing your compute budget.

Preventing Collapse: A deep dive into the Energy-Based regularizers that keep the model’s internal representations from turning into useless noise.

From grasping to complex pick-and-place tasks, we’re looking at a future where robots don’t just follow scripts—they understand the world. Let’s dive in.

[V-JEPA] Beyond Pixels: V-JEPA 2 and the Shift to Action-Conditioned Video Prediction.

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

DeepSeek mHC: Использование гиперсоединений с ограничениями на многообразие для обучения LLM. Обу...

DeepSeek mHC: Использование гиперсоединений с ограничениями на многообразие для обучения LLM. Обу...

LLMs Tutorials

A Theory of the Mechanics of Information - Christopher Hazard

A Theory of the Mechanics of Information - Christopher Hazard

[VL-JEPA] LLM не будут заменены! Совместная архитектура прогнозирования на основе встраивания про...

[VL-JEPA] LLM не будут заменены! Совместная архитектура прогнозирования на основе встраивания про...

Scientists Just Discovered What Came Before the Big Bang—Here's What It Means

Scientists Just Discovered What Came Before the Big Bang—Here's What It Means

Означает ли V-JEPA конец эры LLM? Новое видение искусственного интеллекта от Яна Лекуна.

Означает ли V-JEPA конец эры LLM? Новое видение искусственного интеллекта от Яна Лекуна.

Diffusion Models Tutorial

Diffusion Models Tutorial

GRPO tutorial. Group Relative Policy Optimization.

GRPO tutorial. Group Relative Policy Optimization.

ФИЗИКИ не знают что такое ЭЛЕКТРИЧЕСКИЙ ТОК 💤Лекция для сна 💤 СОН ЗА 5 МИНУТ

ФИЗИКИ не знают что такое ЭЛЕКТРИЧЕСКИЙ ТОК 💤Лекция для сна 💤 СОН ЗА 5 МИНУТ

Что если мы - Марсиане?! / Эволюция Млечного Пути / Астрообзор #198

Что если мы - Марсиане?! / Эволюция Млечного Пути / Астрообзор #198

Why Does Fire BURN? Feynman's Answer Will DESTROY Your Reality

Why Does Fire BURN? Feynman's Answer Will DESTROY Your Reality

Energy Is Not a Thing — It’s the Universe’s Most Perfect Accounting Rule

Energy Is Not a Thing — It’s the Universe’s Most Perfect Accounting Rule

[LLM RAG] Генерация с расширенным поиском (RAG): Улучшение LLM для задач, требующих интенсивного ...

[LLM RAG] Генерация с расширенным поиском (RAG): Улучшение LLM для задач, требующих интенсивного ...

Если ВСЕЛЕННАЯ ДВИЖЕТСЯ то почему НЕБО НЕ МЕНЯЕТСЯ ? 💤Лекция для сна

Если ВСЕЛЕННАЯ ДВИЖЕТСЯ то почему НЕБО НЕ МЕНЯЕТСЯ ? 💤Лекция для сна

Mac Studio M3 Ultra Cluster: Суперкомпьютер для искусственного интеллекта в домашних условиях. EX...

Mac Studio M3 Ultra Cluster: Суперкомпьютер для искусственного интеллекта в домашних условиях. EX...

[GNNs] Graph Neural Networks vs. Graph Transformers: Navigating the Next Frontier of Graph Learning

[GNNs] Graph Neural Networks vs. Graph Transformers: Navigating the Next Frontier of Graph Learning

No One Understands What Elon Just Said About 2026

No One Understands What Elon Just Said About 2026

План Microsoft по снижению недовольства людей искусственным интеллектом и электричеством

План Microsoft по снижению недовольства людей искусственным интеллектом и электричеством

[mHC] DeepSeek. Гиперсвязи с ограничениями на многообразии (mHC): Сдвиг парадигмы в базовых прогр...

[mHC] DeepSeek. Гиперсвязи с ограничениями на многообразии (mHC): Сдвиг парадигмы в базовых прогр...

Guest Lecture 2: Philipp Henzler (Diffusion and Flow Models, Fall 2025, KAIST)

Guest Lecture 2: Philipp Henzler (Diffusion and Flow Models, Fall 2025, KAIST)