Lecture 1, 2024, course overview: RL and DP, AlphaZero, discrete and continuous applications
Автор: Dimitri Bertsekas
Загружено: 2024-04-27
Просмотров: 5090
Slides, class notes, and related textbook material at http://web.mit.edu/dimitrib/www/RLboo...
The sound of the 1st videolecture of the 2024 class turned out to be degraded. I have instead posted the 1st video of the 2023 class, which has better sound and essentially identical content. Slides can be found at https://web.mit.edu/dimitrib/www/RLTo...
The subsequent videolectures 2-13 are from the 2024 offering of the course. The slides of the 1st lecture of 2024 can be found at https://web.mit.edu/dimitrib/www/RLTo...
Lecture Content: Course overview, AlphaZero, off-line training, on-line play, relation to Newton's method. Exact and approximate dynamic programming for deterministic problems, discrete optimization, model predictive and adaptive control, large language models via dynamic programming, approximation in value space and reinforcement learning
Доступные форматы для скачивания:
Скачать видео mp4
-
Информация по загрузке: