Популярное

Музыка Кино и Анимация Автомобили Животные Спорт Путешествия Игры Юмор

Интересные видео

2025 Сериалы Трейлеры Новости Как сделать Видеоуроки Diy своими руками

Топ запросов

смотреть а4 schoolboy runaway турецкий сериал смотреть мультфильмы эдисон
dTub
Скачать

SNAPP Seminar || Kevin Jamieson (University of Washington) || October 7, 2024

Автор: SNAPP Seminar

Загружено: 2024-10-08

Просмотров: 223

Описание:

Speaker: Kevin Jamieson (University of Washington) || October 7, 2024, Mon, 11:30am Eastern Time

Title: Sample Complexity Reduction via Policy Difference Estimation in Tabular Reinforcement Learning

Abstract: In this talk, we explore the non-asymptotic sample complexity for the pure exploration problem in both contextual bandits and tabular reinforcement learning (RL), specifically focusing on identifying an ε-optimal policy from a given set of policies Π with high probability. In the bandit setting, prior work has demonstrated that it is possible to identify the best policy by focusing on estimating only the differences in behaviors between individual policies, rather than estimating each policy’s behavior independently, leading to significant improvements in sample efficiency. However, the best-known approaches for tabular RL fail to exploit this idea and instead estimate the behavior of each policy individually. We investigate whether this efficiency can be extended to RL by estimating only the differences in policy behaviors, and we present a nuanced answer. For contextual bandits, we show that such an approach is indeed sufficient. However, for tabular RL, we establish that it is not, revealing a key distinction between the two settings. Nevertheless, we propose a new approach inspired by this observation, showing that it is nearly sufficient to estimate behavior differences in RL when anchored by a reference policy. Our algorithm leverages this insight to provide the tightest known bound on the sample complexity of tabular RL, offering both theoretical advancements and practical implications for reinforcement learning research.

Speaker's Bio: Kevin Jamieson is an Associate Professor in the Paul G. Allen School of Computer Science & Engineering at the University of Washington. He received his B.S. in 2009 from the University of Washington under the advisement of Maya Gupta, his M.S. in 2010 from Columbia University under the advisement of Rui Castro, and his Ph.D. in 2015 from the University of Wisconsin - Madison under the advisement of Robert Nowak, all in electrical engineering. He returned to the University of Washington as faculty in 2017 after a postdoc in the AMP lab at the University of California, Berkeley working with Benjamin Recht. Jamieson's work has been recognized by an NSF CAREER award and Amazon Faculty Research award.

SNAPP Seminar || Kevin Jamieson (University of Washington) || October 7, 2024

Поделиться в:

Доступные форматы для скачивания:

Скачать видео mp4

  • Информация по загрузке:

Скачать аудио mp3

Похожие видео

SNAPP Seminar || Zeyu Zheng (University of California, Berkeley) || October 28, 2024

SNAPP Seminar || Zeyu Zheng (University of California, Berkeley) || October 28, 2024

UW Lecture: A New Era of Cosmic Discovery with the Rubin Observatory

UW Lecture: A New Era of Cosmic Discovery with the Rubin Observatory

SNAPP Seminar || Minshuo Chen (Northwestern University) || April 14, 2025

SNAPP Seminar || Minshuo Chen (Northwestern University) || April 14, 2025

4 Hours Chopin for Studying, Concentration & Relaxation

4 Hours Chopin for Studying, Concentration & Relaxation

The Bullitt Center: The Greenest Commercial Building in the World

The Bullitt Center: The Greenest Commercial Building in the World

SNAPP Seminar || Carri Chan (Columbia University) || April 28, 2025

SNAPP Seminar || Carri Chan (Columbia University) || April 28, 2025

Tunneling Toward a New State Route 99 Corridor

Tunneling Toward a New State Route 99 Corridor

What Really Happened 13.8 Billion Years Ago?

What Really Happened 13.8 Billion Years Ago?

Hubble at 25 & the James Webb Space Telescope: Dr. Amber Straughn

Hubble at 25 & the James Webb Space Telescope: Dr. Amber Straughn

SNAPP Seminar || Vianney Perchet (ENSAE) || November 3, 2025

SNAPP Seminar || Vianney Perchet (ENSAE) || November 3, 2025

Владимир Пастухов и Максим Курников | Интервью BILD

Владимир Пастухов и Максим Курников | Интервью BILD

Crows: Smarter Than You Think with UW Professor John Marzluff

Crows: Smarter Than You Think with UW Professor John Marzluff

CARTA: The Evolution of Human Biodiversity: Evan Eichler -Genome Structural Variation

CARTA: The Evolution of Human Biodiversity: Evan Eichler -Genome Structural Variation

Stein's Method for Queueing Approximations Lecture 1 (SNAPP Summer School 2025)

Stein's Method for Queueing Approximations Lecture 1 (SNAPP Summer School 2025)

The Search for Randomness with Persi Diaconis

The Search for Randomness with Persi Diaconis

SNAPP Seminar || Harsha Honnappa (Purdue University) || March 31, 2025

SNAPP Seminar || Harsha Honnappa (Purdue University) || March 31, 2025

Marketing Strategy Based on First Principles and Data Analytics - Chapter 3

Marketing Strategy Based on First Principles and Data Analytics - Chapter 3

Howard Frumkin: What is Planetary Health and Why Now

Howard Frumkin: What is Planetary Health and Why Now

UW ECE 2023-2024 Dean W. Lytle Electrical & Computer Engineering Endowed Lecture Series

UW ECE 2023-2024 Dean W. Lytle Electrical & Computer Engineering Endowed Lecture Series

Rocking the World of Physics

Rocking the World of Physics

© 2025 dtub. Все права защищены.



  • Контакты
  • О нас
  • Политика конфиденциальности



Контакты для правообладателей: [email protected]