David Silver - Deep Reinforcement Learning from AlphaGo to AlphaStar (Talk back at UAlberta) Part 1

Автор: AlbertaAI (Alberta Artificial Intelligence Association)

Загружено: 2019-07-25

Просмотров: 1741

Описание:

Deep Reinforcement Learning from AlphaGo to AlphaStar
July 23, 2019 4:00 PM - 5:30 PM
1-440, Centennial Centre for Interdisciplinary Science (CCIS)

Self-learning systems have achieved remarkable success in several challenging problems for artificial intelligence by combining reinforcement learning with deep neural networks. In this talk, David Silver ('09 PhD) will describe the origins of AlphaGo: the first program to defeat a human champion in the game of Go; AlphaZero: which learned, from scratch, to also defeat the world computer champions in chess and shogi; and AlphaStar: the first program to defeat a human champion in the real-time strategy game of StarCraft.

Date & Time:
Tuesday, July 23, 2019
4:00 p.m. Public lecture
5:00 p.m. Moderated Q &A

Location:
1-440, Centennial Centre for Interdisciplinary Science (CCIS)
University of Alberta

Biography:

David Silver ('09 PhD) leads the reinforcement learning group at DeepMind. Silver's research caught the world’s attention in 2015, when AlphaGo, the program he started during his PhD studying with computing science professors Rich Sutton and Martin Müller, bested the world’s Go Champion, Lee Sedol.

David graduated from Cambridge University in 1997 with the Addison-Wesley award. Subsequently, David co-founded the video games company Elixir Studios, where he was CTO and lead programmer, receiving several awards for technology and innovation. David returned to academia in 2004 to study for a PhD on reinforcement learning with Rich Sutton, where he co-introduced the algorithms used in the first master-level 9x9 Go programs. David was awarded a Royal Society University Research Fellowship in 2011, and subsequently became a professor at University College London.

David consulted for DeepMind from its inception, joining full-time in 2013, where he leads the reinforcement learning team. David co-led the Atari project, in which a program learned to play 50 different games directly from pixels . He is best-known for leading the AlphaGo project, culminating in the first program to defeat a top professional player in the full-size game of Go, as well as the AlphaZero project (in which a program learned by itself to defeat the world's strongest chess, shogi and Go programs). These achievements have been recognized by awards such as the Marvin Minsky Medal, Royal Academy of Engineering Silver Medal, Mensa Foundation Prize, Cannes Lion Grand Prix, and several best paper awards.

David Silver - Deep Reinforcement Learning from AlphaGo to AlphaStar (Talk back at UAlberta) Part 1

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

David Silver - Deep Reinforcement Learning from AlphaGo to AlphaStar (Talk back at UAlberta) Part 2

David Silver - Deep Reinforcement Learning from AlphaGo to AlphaStar (Talk back at UAlberta) Part 2

Norbert R. Morgenstern Open Lecture: From Certainty to Uncertainty in 64 Years

Norbert R. Morgenstern Open Lecture: From Certainty to Uncertainty in 64 Years

A Smart Move: AI & strategy games

A Smart Move: AI & strategy games

Is human data enough? | David Silver

Is human data enough? | David Silver

David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning | Lex Fridman Podcast #86

David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning | Lex Fridman Podcast #86

The Hubble Space Telescope: The Agony and the Ecstasy

The Hubble Space Telescope: The Agony and the Ecstasy

NUS120 Distinguished Speaker Series | Professor Richard Sutton

NUS120 Distinguished Speaker Series | Professor Richard Sutton

PHYS 485 CP, T and CPT

PHYS 485 CP, T and CPT

GUARDIOLA ZWOLNIŁ XABIEGO? PRZEŁAMANIE RODRYGO! REAL MADRYT - MANCHESTER CITY, SKRÓT MECZU

GUARDIOLA ZWOLNIŁ XABIEGO? PRZEŁAMANIE RODRYGO! REAL MADRYT - MANCHESTER CITY, SKRÓT MECZU

AlphaGo - The Movie | Full award-winning documentary

AlphaGo - The Movie | Full award-winning documentary

PHYS 130 Optics: The Telescope

PHYS 130 Optics: The Telescope

DLRLSS 2019 - What’s Next - Yoshua Bengio

DLRLSS 2019 - What’s Next - Yoshua Bengio

The Historical Quest to See to the End of the Universe… Or Its Beginning

The Historical Quest to See to the End of the Universe… Or Its Beginning

Paul Hawken: Reimagination of Carbon

Paul Hawken: Reimagination of Carbon

Rupam Mahmood , Streaming Deep RL, Upper Bound 2025

Rupam Mahmood , Streaming Deep RL, Upper Bound 2025

The Tea Time Talks: Rich Sutton, Are You Ready to Fully Embrace Approximation? (June 8, 2020)

The Tea Time Talks: Rich Sutton, Are You Ready to Fully Embrace Approximation? (June 8, 2020)

„Ukraińcy i Amerykanie się z nami nie liczą”. Bartosiak bez złudzeń

„Ukraińcy i Amerykanie się z nami nie liczą”. Bartosiak bez złudzeń

Ilya Sutskever: OpenAI Meta-Learning and Self-Play | MIT Artificial General Intelligence (AGI)

Ilya Sutskever: OpenAI Meta-Learning and Self-Play | MIT Artificial General Intelligence (AGI)

UAlberta Engineering Profs Debunk Myths

UAlberta Engineering Profs Debunk Myths

Gospodarcza katastrofa? Kryzys zmusi Putina do zawarcia rozejmu? — Paweł Jeżowski i Piotr Zychowicz

Gospodarcza katastrofa? Kryzys zmusi Putina do zawarcia rozejmu? — Paweł Jeżowski i Piotr Zychowicz