RL 1: Multi-armed Bandits 1
Автор: AI Insights - Rituraj Kaushik
Загружено: 2019-01-23
Просмотров: 14775
In this video we discuss about multi-armed bandit problem and how to solve it intuitively. This is entry point into Reinforcement Learning.
Reinforcement learning tutorial series:
1. Multi-armed Bandits: • RL 1: Multi-armed Bandits 1
2. Multi-Armed Bandits - Action value estimation: • RL 2: Multi-Armed Bandits 2 - Action value...
3. Upper confidence bound: • RL 3: Upper confidence bound (UCB) to solv...
4. Thompson Sampling: • RL 4: Thompson Sampling - Multi-armed bandits
5. Markov Decision Process - MDP: • RL 5: Markov Decision Process - MDP | Rein...
6. Policy iteration and value iteration: • RL 6: Policy iteration and value iteration...
Доступные форматы для скачивания:
Скачать видео mp4
-
Информация по загрузке: