Resurrecting Recurrent Neural Networks for Long Sequences | Razvan Pascanu

Автор: ICARL

Загружено: 2024-04-17

Просмотров: 1709

Описание:

ICARL Seminar Series - 2023 Winter

Resurrecting Recurrent Neural Networks for Long Sequences
Seminar by Razvan Pascanu

Abstract:
In this talk, Razvan Pascanu will focus on State Space Models (SSM), a recently introduced family of sequential models and specifically discuss the relationship between SSMs and recurrent neural networks. He will start with a short history of architecture design for language modelling, which he will use as a motivating task. This will allow to provide some insights in the evolution of RNN architectures, and why some choices behind the SSM architecture seemed counter-intuitive. Most of the talk will focus on introducing the Linear Recurrent Unit architecture, explaining the role of the various modifications from traditional non-linear recurrent models.

The talk will conclude with some open questions about the role recurrent architectures could or should play, and potentially the less well understood relationship between these SSM models and transformer like architectures.

About the Speaker

Razvan Pascanu has been a research scientist at Google DeepMind since 2014. Before this, he did his PhD in Montréal with prof. Yoshua Bengio, working on understanding deep networks, recurrent models and optimization. Since he joined DeepMind he has also had significant contributions in deep reinforcement learning, continual learning, meta-learning, graph neural networks as well as continuing his research agenda of understanding deep learning, recurrent models and optimization. Please see his scholar page for specific contributions. He is also actively promoting AI research and education as a main organizer of Conference on Life-long Learning Agents (CoLLAs) lifelong-ml.cc , Eastern European Machine Learning Summer School (EEML) www.eeml.eu and www.workshops.eeml.eu as well as different workshops at NeurIPS, ICML and ICLR.

——————————————————
Links
Razvan Pascanu
Site: https://sites.google.com/view/razp

ICARL
Site: icarl.doc.ic.ac.uk
Twitter: twitter.com/ic_arl
YouTube: @ICARLSeminars
——————————————————

Resurrecting Recurrent Neural Networks for Long Sequences | Razvan Pascanu

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

Dynamic Deep Learning | Richard Sutton

Dynamic Deep Learning | Richard Sutton

Exploring Alternative Bio-Inspired Neural Building Blocks for Fast RL | Sebastian Risi

Exploring Alternative Bio-Inspired Neural Building Blocks for Fast RL | Sebastian Risi

Challenges in Deep Learning (Dr Razvan Pascanu - DeepMind)

Challenges in Deep Learning (Dr Razvan Pascanu - DeepMind)

Краткое объяснение больших языковых моделей

Краткое объяснение больших языковых моделей

Real-world Reinforcement Learning in Multi-Agent Systems | Eugene Vinitsky

Real-world Reinforcement Learning in Multi-Agent Systems | Eugene Vinitsky

Момент, когда мы перестали понимать ИИ [AlexNet]

Момент, когда мы перестали понимать ИИ [AlexNet]

Но что такое нейронная сеть? | Глава 1. Глубокое обучение

Но что такое нейронная сеть? | Глава 1. Глубокое обучение

Новый завод по производству микросхем в Америке — катастрофа стоимостью 50 миллиардов долларов

Новый завод по производству микросхем в Америке — катастрофа стоимостью 50 миллиардов долларов

Что такое стек ИИ? Магистратура LLM, RAG и аппаратное обеспечение ИИ

Что такое стек ИИ? Магистратура LLM, RAG и аппаратное обеспечение ИИ

Почему диффузия работает лучше, чем авторегрессия?

Почему диффузия работает лучше, чем авторегрессия?

КАК ПОГИБАЕТ ЛОНДОН

КАК ПОГИБАЕТ ЛОНДОН

Passive Learning of Active Causal Strategies in Agents and Language Models | Andrew Lampinen

Passive Learning of Active Causal Strategies in Agents and Language Models | Andrew Lampinen

Армия смерти Гитлера: Дас Райх — БЕЗ ЦЕНЗУРЫ

Армия смерти Гитлера: Дас Райх — БЕЗ ЦЕНЗУРЫ

Внимание — это всё, что вам нужно (Transformer) — объяснение модели (включая математику), вывод и...

Внимание — это всё, что вам нужно (Transformer) — объяснение модели (включая математику), вывод и...

Champion-Level Drone Racing using Deep Reinforcement Learning | Leonard Bauersfeld

Champion-Level Drone Racing using Deep Reinforcement Learning | Leonard Bauersfeld

Визуализация гравитации

Визуализация гравитации

Наш интеллект УМИРАЕТ. Как ИИ разрушает сознание? | Нейробиолог Алипов, Михаил Никитин

Наш интеллект УМИРАЕТ. Как ИИ разрушает сознание? | Нейробиолог Алипов, Михаил Никитин

Тайны полифонии Баха — как работает гениальный мозг?

Тайны полифонии Баха — как работает гениальный мозг?

Как электростатические двигатели нарушают все правила

Как электростатические двигатели нарушают все правила

Вы просыпаетесь в 3 часа ночи? Вашему телу нужна помощь! Почему об этом не говорят?

Вы просыпаетесь в 3 часа ночи? Вашему телу нужна помощь! Почему об этом не говорят?