Sherry Yang - Learning World Models and Agents for High-Cost Environments

Автор: uclanlp-plus

Загружено: 2025-12-18

Просмотров: 69

Описание:

Talk Title: Learning World Models and Agents for High-Cost Environments

Abstract: While neural networks have achieved superhuman performance in domains with low-cost simulations—from AlphaGo to LLMs—their application to the physical world is bottlenecked by a fundamental challenge: high-cost interactions. In fields like robotics, ML engineering, and the natural sciences, every action or experiment is expensive and time-consuming. This talk outlines strategies for building intelligent agents that learn efficiently despite these real-world constraints. We first address the physical world by showing how learned world models can serve as high-fidelity simulators for robotics, enabling extensive policy refinement before deployment on costly hardware. We then turn to complex engineering domains, where actions like running an ML program incur significant time delays, and discuss adaptations to reinforcement learning to make it robust for these long action settings. Finally, we show how compositional generative models can navigate the vast hypothesis spaces in science, intelligently proposing experiments to accelerate the pace of discovery.

To checkout other talks in our full NLP Seminar Series, please visit: • UCLA NLP Seminar Series

Sherry Yang - Learning World Models and Agents for High-Cost Environments

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

Arman Cohan - Evaluating and Understanding LLMs: From Scientific Reasoning to Alignment as Judges

Arman Cohan - Evaluating and Understanding LLMs: From Scientific Reasoning to Alignment as Judges

Aviral Kumar - The Importance of Exploration for Test-Time Scaling

Aviral Kumar - The Importance of Exploration for Test-Time Scaling

Natasha Jaques - Social Reinforcement Learning for pluralistic alignment and human-AI interaction

Natasha Jaques - Social Reinforcement Learning for pluralistic alignment and human-AI interaction

4 Hours Chopin for Studying, Concentration & Relaxation

4 Hours Chopin for Studying, Concentration & Relaxation

YaC 2025 AI Edition

YaC 2025 AI Edition

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

5 Types of AI Agents: Autonomous Functions & Real-World Applications

5 Types of AI Agents: Autonomous Functions & Real-World Applications

Как Сделать Настольный ЭЛЕКТРОЭРОЗИОННЫЙ Станок?

Как Сделать Настольный ЭЛЕКТРОЭРОЗИОННЫЙ Станок?

Но что такое нейронная сеть? | Глава 1. Глубокое обучение

Но что такое нейронная сеть? | Глава 1. Глубокое обучение

Zhe Gan - How to Build Your Multimodal LLMs: From Pre-training to Post-training and Agents

Zhe Gan - How to Build Your Multimodal LLMs: From Pre-training to Post-training and Agents

There Is Something Faster Than Light

There Is Something Faster Than Light

Stanford CS234: Reinforcement Learning | Winter 2019 | Lecture 1 - Introduction - Emma Brunskill

Stanford CS234: Reinforcement Learning | Winter 2019 | Lecture 1 - Introduction - Emma Brunskill

The Essential Main Ideas of Neural Networks

The Essential Main Ideas of Neural Networks

Sherry Yang - Learning World Models and Physical Agents

Sherry Yang - Learning World Models and Physical Agents

What are AI Agents?

What are AI Agents?

MIT 6.S191: Reinforcement Learning

MIT 6.S191: Reinforcement Learning

Reinforcement Learning: Machine Learning Meets Control Theory

Reinforcement Learning: Machine Learning Meets Control Theory

Don't learn AI Agents without Learning these Fundamentals

Don't learn AI Agents without Learning these Fundamentals

MIT 6.S091: Introduction to Deep Reinforcement Learning (Deep RL)

MIT 6.S091: Introduction to Deep Reinforcement Learning (Deep RL)

💥7 МИНУТ НАЗАД! Серия убийств ТОП ГЕНЕРАЛОВ РФ! Спецслужбы БЕССИЛЬНЫ, у Z-ников ИСТЕРИКА - НАКИ

💥7 МИНУТ НАЗАД! Серия убийств ТОП ГЕНЕРАЛОВ РФ! Спецслужбы БЕССИЛЬНЫ, у Z-ников ИСТЕРИКА - НАКИ