Multi-Agent Reinforcement Learning Chapter 4: Nash Equilibrium and Welfare/Fairness Criteria

Автор: Jason Eckstein

Загружено: 2025-11-25

Просмотров: 85

Описание:

Live recording of online meeting reviewing material from "Multi-Agent Reinforcement Learning: Foundations and Modern Approaches" by Stefano V. Albrecht, Filippos Christianos, Lukas Schäfer. In this meeting we analyze general sum games and equilibrium solutions such as the Nash equilibrium. We solve the 2x2 game exactly and note some of the shortcomings of equilibrium solutions including non-uniqueness. Finally, we address methods of filtering solutions by criteria that maximize total reward across the agents or evenly distributed rewards. A few example games are used to illustrate what types of solutions exist and are desirable including: chicken, battle of the sexes, prisoner's dilemma, and stag-hunt.

The textbook website contains materials provided by the authors including a pdf of the text, slides, and a github repository with code.

MARL textbook website: https://www.marl-book.com/
MARL kickoff slides: https://docs.google.com/presentation/...

This online meeting is hosted through https://www.meetup.com/boulderdatasci... and https://www.meetup.com/silicon-valley...

For background material covering traditional reinforcement learning see the following playlist: • Reinforcement Learning Tutorial Meetings

Notes and interactive tools seen in those video use the Julia Language (https://julialang.org/) and the package Pluto.jl (https://plutojl.org/).

Previous meetings have covered the textbook "Reinforcement Learning: An Introduction" by Richard S. Sutton and Andrew G. Barto and the following links relate to that material and my notes/code based on it.

Sutton and Barto Textbook: http://incompleteideas.net/book/the-b...
HTML Notes: https://jekyllstein.github.io/Reinfor...
GitHub Repository: https://github.com/jekyllstein/Reinfo...

#reinforcementlearning #education #multiplayergames

Multi-Agent Reinforcement Learning Chapter 4: Nash Equilibrium and Welfare/Fairness Criteria

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

Multi-Agent Reinforcement Learning Chapter 5: Reinforcement Learning in Games

Multi-Agent Reinforcement Learning Chapter 5: Reinforcement Learning in Games

Управление поведением LLM без тонкой настройки

Управление поведением LLM без тонкой настройки

ГЛАВНЫЙ ТРЕНД ИИ 2026 ГОДА! АБСОЛЮТНО НОВЫЕ ИИ АГЕНТЫ

ГЛАВНЫЙ ТРЕНД ИИ 2026 ГОДА! АБСОЛЮТНО НОВЫЕ ИИ АГЕНТЫ

Как происходит модернизация остаточных соединений [mHC]

Как происходит модернизация остаточных соединений [mHC]

Роботы, Которых Никто Не Ожидал Увидеть на CES 2026

Роботы, Которых Никто Не Ожидал Увидеть на CES 2026

Multi-Agent Reinforcement Learning Chapter 6: Joint-Action Learning with Game Theory

Multi-Agent Reinforcement Learning Chapter 6: Joint-Action Learning with Game Theory

Reinforcement Learning Tutorial Meetings

Reinforcement Learning Tutorial Meetings

ЧП на стратегическом объекте / Москва не ожидала такого удара

ЧП на стратегическом объекте / Москва не ожидала такого удара

Multi-Agent Learning Kickoff Meeting

Multi-Agent Learning Kickoff Meeting

Екатерина Шульман. Был ли авторитарный разворот заложен в Конституции 1993? / Лекция №5

Екатерина Шульман. Был ли авторитарный разворот заложен в Конституции 1993? / Лекция №5

Главная война столетия. США против Китая

Главная война столетия. США против Китая

Multi-Agent Reinforcement Learning Chapter 6: Value Iteration for Zero-Sum Games

Multi-Agent Reinforcement Learning Chapter 6: Value Iteration for Zero-Sum Games

Вебинар по теме:

Вебинар по теме: "Как читать и понимать любые электрические схемы: от нуля до уверенного уровня"

Добавьте когнитивную топологию в ваши ИИ-агенты.

Добавьте когнитивную топологию в ваши ИИ-агенты.

Но что такое нейронная сеть? | Глава 1. Глубокое обучение

Но что такое нейронная сеть? | Глава 1. Глубокое обучение

JAKUCK, ROSJA 2026: PRZETRWANIE W TEMPERATURZE -71°C! - NAJZIMNIEJSZE MIASTO NA ŚWIECIE DOKUMENTALNY

JAKUCK, ROSJA 2026: PRZETRWANIE W TEMPERATURZE -71°C! - NAJZIMNIEJSZE MIASTO NA ŚWIECIE DOKUMENTALNY

Emacs в 2026: Секретное оружие или старый хлам? |vim, vscode, lisp, org-mode|Podlodka Podcast #460

Emacs в 2026: Секретное оружие или старый хлам? |vim, vscode, lisp, org-mode|Podlodka Podcast #460

Гипотеза Пуанкаре — Алексей Савватеев на ПостНауке

Гипотеза Пуанкаре — Алексей Савватеев на ПостНауке

Миллей в Давосе: «Запад умирает от социализма!»

Миллей в Давосе: «Запад умирает от социализма!»

⚡️ МИЛОВ: Нефть по $39 добила бюджет рф! Масштабный блэкаут и аварии ЖКХ СРАЗУ В НЕСКОЛЬКИХ городах

⚡️ МИЛОВ: Нефть по $39 добила бюджет рф! Масштабный блэкаут и аварии ЖКХ СРАЗУ В НЕСКОЛЬКИХ городах