Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 7: Offline RL

Автор: Stanford Online

Загружено: 2025-12-08

Просмотров: 1787

Описание:

View course details: https://online.stanford.edu/courses/x...

April 23, 2025
This lecture covers:
• Key challenges arising in offline reinforcement learning
• Two approaches for offline RL (& why they work!)
• How offline RL can improve over imitation learning

To learn more about enrolling in the graduate course, visit: https://online.stanford.edu/courses/c...

To follow along with the course schedule and syllabus, visit:
https://cs224r.stanford.edu/

Chelsea Finn
Assistant Professor in Computer Science and Electrical Engineering at Stanford University and co-founder of Pi.

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 7: Offline RL

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 8: Reward Learning

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 8: Reward Learning

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 15: Hierarchical RL and IL

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 15: Hierarchical RL and IL

Causal Mechanistic Interpretability (Stanford lecture 1) - Atticus Geiger

Causal Mechanistic Interpretability (Stanford lecture 1) - Atticus Geiger

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 1: Class Intro

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 1: Class Intro

LLM fine-tuning или ОБУЧЕНИЕ малой модели? Мы проверили!

LLM fine-tuning или ОБУЧЕНИЕ малой модели? Мы проверили!

DeepMind x UCL | Introduction to Reinforcement Learning 2015

DeepMind x UCL | Introduction to Reinforcement Learning 2015

ЛЕКЦИЯ ПРО НАДЁЖНЫЕ ШИФРЫ НА КОНФЕРЕНЦИИ БАЗОВЫХ ШКОЛ РАН В ТРОИЦКЕ

ЛЕКЦИЯ ПРО НАДЁЖНЫЕ ШИФРЫ НА КОНФЕРЕНЦИИ БАЗОВЫХ ШКОЛ РАН В ТРОИЦКЕ

[NeurIPS Best Paper] 1000 Layer Networks for Self-Supervised RL — Kevin Wang et al, Princeton

[NeurIPS Best Paper] 1000 Layer Networks for Self-Supervised RL — Kevin Wang et al, Princeton

Почему Питер Шольце — математик, каких бывает раз в поколение?

Почему Питер Шольце — математик, каких бывает раз в поколение?

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 6: Q-Learning

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 6: Q-Learning

🧪🧪🧪🧪Как увидеть гиперпространство (4-е измерение)

🧪🧪🧪🧪Как увидеть гиперпространство (4-е измерение)

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 16: RL for Robots

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 16: RL for Robots

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 18: Frontiers

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 18: Frontiers

Самая сложная модель из тех, что мы реально понимаем

Самая сложная модель из тех, что мы реально понимаем

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 5: Off-Policy Actor Critic

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 5: Off-Policy Actor Critic

Стэнфорд CS224R Глубокое обучение с подкреплением | Весна 2025 г. | Лекция 17: Развитие интеллект...

Стэнфорд CS224R Глубокое обучение с подкреплением | Весна 2025 г. | Лекция 17: Развитие интеллект...

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 11: Model-Based RL

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 11: Model-Based RL

Richard Sutton – Father of RL thinks LLMs are a dead end

Richard Sutton – Father of RL thinks LLMs are a dead end

Математическая тревожность, нейросети, задачи тысячелетия / Андрей Коняев

Математическая тревожность, нейросети, задачи тысячелетия / Андрей Коняев

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 4: Actor-Critic Methods

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 4: Actor-Critic Methods