Скачать
2 - Deep RL and RL post-training intro
Автор: Natasha Jaques
Загружено: 2025-10-07
Просмотров: 247
Описание:
Second lecture for CSE 599J on Social Reinforcement Learning: https://courses.cs.washington.edu/cou.... Extremely fast intro to deep reinforcement learning algorithms, covering popular off-policy and on-policy algorithms and how to debug them. Then gets into RL post-training of language models, and a brief intro to RL from Human Feedback (RLHF).

Доступные форматы для скачивания:
Скачать видео mp4
-
Информация по загрузке: