LEMAS Seminar by Professor Rasoul Etesami (UIUC) on Learning in Stochastic Games

Автор: Berkeley-HopkinsLEMAS

Загружено: 2026-01-07

Просмотров: 4

Описание:

Title: Learning Stationary Nash Equilibrium Policies in n-Player Stochastic Games with Independent Chains

Abstract: Motivated by applications such as bandwidth allocation in wireless communication networks and energy management in smart grids, we consider a subclass of n-player stochastic games, in which players have their own internal state spaces while they are coupled through their payoff functions. It is assumed that players' internal chains are driven by independent transition probabilities. Moreover, players receive only realizations of their payoffs and cannot observe each other's states or actions. For this class of stochastic games, we first show that finding a stationary Nash equilibrium (NE) policy is intractable without additional assumptions on the reward functions. Nevertheless, for general reward functions, we develop polynomial-time learning algorithms based on dual averaging that converge to the set of ε-NE policies in terms of the averaged Nikaido-Isoda distance, either almost surely or with high probability. We further extend our results to the settings with unknown transition probabilities. In particular, under additional assumptions on the reward functions or the equilibrium structure, we show that our algorithms indeed converge to an ε-NE policy with high probability in polynomial time. Finally, we demonstrate the effectiveness of the proposed algorithms through numerical experiments on energy management in smart grids.

Bio: Rasoul Etesami is an Associate Professor in the Department of Industrial and Systems Engineering at the University of Illinois Urbana-Champaign (UIUC), where he is also affiliated with the Department of Electrical and Computer Engineering and the Coordinated Science Laboratory. Prior to joining the faculty at UIUC, he was a Postdoctoral Research Fellow in the Department of Electrical Engineering at Princeton University and WINLAB. He received his Ph.D. in Electrical and Computer Engineering from UIUC in 2015. His research interests include the analysis of complex socioeconomic and decision-making systems using tools from control theory, game theory, optimization, and learning theory. He is the recipient of the Best CSL Ph.D. Thesis Award at UIUC in 2016, the Springer Outstanding Ph.D. Thesis Award in 2017, the NSF CAREER Award in 2020, the US Air Force Young Investigator Award in 2023, the SIAM Journal on Control and Optimization Best Paper Award in 2025, and multiple best teaching and reviewer awards. He has served as General Chair for conferences such as the Annual Allerton Conference and the C3.ai DTI Workshop, and as an Associate Editor for journals including IET Smart Grid and Dynamic Games and Applications

LEMAS Seminar by Professor Rasoul Etesami (UIUC) on Learning in Stochastic Games

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

Почему простые числа образуют эти спирали? | Теорема Дирихле и пи-аппроксимации

Почему простые числа образуют эти спирали? | Теорема Дирихле и пи-аппроксимации

Разведчик о том, как использовать людей

Разведчик о том, как использовать людей

Crawfordsville Pelican Pod with Indiana AID

Crawfordsville Pelican Pod with Indiana AID

LEMAS Seminar by Professor Jorge Poveda (UCSD) on Deception and Incentives in Multi-Agent Learning

LEMAS Seminar by Professor Jorge Poveda (UCSD) on Deception and Incentives in Multi-Agent Learning

КУСТО УВИДЕЛ ЧТО СКРЫВАЛИ НА ДНЕ БАЙКАЛА! О ЧЕМ МОЛЧАЛ СССР?

КУСТО УВИДЕЛ ЧТО СКРЫВАЛИ НА ДНЕ БАЙКАЛА! О ЧЕМ МОЛЧАЛ СССР?

Уильям Детмер - Процесс Логического Мышления. Глава 1.

Уильям Детмер - Процесс Логического Мышления. Глава 1.

System Design Concepts Course and Interview Prep

System Design Concepts Course and Interview Prep

⚡️ ВСУ отвели с позиций || Зеленский принял ультиматум Путина?

⚡️ ВСУ отвели с позиций || Зеленский принял ультиматум Путина?

Но что такое нейронная сеть? | Глава 1. Глубокое обучение

Но что такое нейронная сеть? | Глава 1. Глубокое обучение

Уоррен Баффет: Если вы хотите разбогатеть, перестаньте покупать эти 5 вещей.

Уоррен Баффет: Если вы хотите разбогатеть, перестаньте покупать эти 5 вещей.

LEMAS Seminar by Professor Guannan Qu (CMU) on Locally Interdependent Multi-Agent MDP

LEMAS Seminar by Professor Guannan Qu (CMU) on Locally Interdependent Multi-Agent MDP

Why I Left Quantum Computing Research

Why I Left Quantum Computing Research

САМЫЙ СТРАШНЫЙ КРИЗИС ЗА 30 ЛЕТ / МАРАТ КАИРЛЕНОВ / ДАЮ СЛОВО!

САМЫЙ СТРАШНЫЙ КРИЗИС ЗА 30 ЛЕТ / МАРАТ КАИРЛЕНОВ / ДАЮ СЛОВО!

Когнитивные искажения и ошибки восприятия. Лекция в Ереване. День 1

Когнитивные искажения и ошибки восприятия. Лекция в Ереване. День 1

Почему ДНК литовцев шокировало ученых? Тайна их происхождения

Почему ДНК литовцев шокировало ученых? Тайна их происхождения

Белоусов продолжает позориться: новый доклад о контроле над Купянском

Белоусов продолжает позориться: новый доклад о контроле над Купянском

Понимание вибрации и резонанса

Понимание вибрации и резонанса

Как LLM могут хранить факты | Глава 7, Глубокое обучение

Как LLM могут хранить факты | Глава 7, Глубокое обучение

Всё, что вам нужно знать о видеокодеках: Часть 1

Всё, что вам нужно знать о видеокодеках: Часть 1

Если у тебя спросили «Как твои дела?» — НЕ ГОВОРИ! Ты теряешь свою силу | Еврейская мудрость

Если у тебя спросили «Как твои дела?» — НЕ ГОВОРИ! Ты теряешь свою силу | Еврейская мудрость