Q-Learning y el aprendizaje por refuerzo: Teoría y práctica con Python

Автор: Iván García-Ferreira

Загружено: 2023-01-16

Просмотров: 17454

Описание:

Q-learning es el algoritmo más simple para hacer aprendizaje por refuerzo y en este vídeo explicamos con detalle cómo es y cómo programar nuestro primer agente utilizando Q-Learning en Python

Secciones:
0:00 Introducción
0:48 Explicando el problema
1:43 Recompensas y castigos
2:48 ¿Cómo hacemos que aprenda un agente?
3:15 Fórmula de Q-Learning
4:30 Ejemplo de aplicación
5:24 Explicación de la práctica en Python
7:18 Programación del agente
24:42 Agente aprendiendo a jugar al Mountain car

Si quieres más contenido de este tipo no dejes de entrar en mi blog, donde encontrarás mucho más detalle a todos estos temas y código para que puedas empezar tus pruebas:
https://www.garcia-ferreira.es/

Q-Learning y el aprendizaje por refuerzo: Teoría y práctica con Python

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

Introduccion a Inteligencia Artificial

Introduccion a Inteligencia Artificial

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

MIT 6.S191 (2023): Reinforcement Learning

MIT 6.S191 (2023): Reinforcement Learning

Q-Learning Tutorial in Python - Reinforcement Learning

Q-Learning Tutorial in Python - Reinforcement Learning

Почему простые числа образуют эти спирали? | Теорема Дирихле и пи-аппроксимации

Почему простые числа образуют эти спирали? | Теорема Дирихле и пи-аппроксимации

Но что такое нейронная сеть? | Глава 1. Глубокое обучение

Но что такое нейронная сеть? | Глава 1. Глубокое обучение

TUTORIAL: Tu primer modelo de Machine Learning en español | Regresion Lineal con python

TUTORIAL: Tu primer modelo de Machine Learning en español | Regresion Lineal con python

Introducción a Q-Learning - con Beatriz Cabrero

Introducción a Q-Learning - con Beatriz Cabrero

Reinforcement Learning: Machine Learning Meets Control Theory

Reinforcement Learning: Machine Learning Meets Control Theory

Predicción de series temporales con machine learning

Predicción de series temporales con machine learning

MIT 6.S091: Introduction to Deep Reinforcement Learning (Deep RL)

MIT 6.S091: Introduction to Deep Reinforcement Learning (Deep RL)

Introducción al aprendizaje por refuerzo

Introducción al aprendizaje por refuerzo

El APRENDIZAJE POR REFUERZO: la guía DEFINITIVA

El APRENDIZAJE POR REFUERZO: la guía DEFINITIVA

Градиентный спуск, как обучаются нейросети | Глава 2, Глубинное обучение

Градиентный спуск, как обучаются нейросети | Глава 2, Глубинное обучение

Q-Learning: Model Free Reinforcement Learning and Temporal Difference Learning

Q-Learning: Model Free Reinforcement Learning and Temporal Difference Learning

Funciones de activación a detalle (Redes neuronales)

Funciones de activación a detalle (Redes neuronales)

Why Every Trader Needs to Know This: Dr. Thomas Starke on Machine Learning Trading

Why Every Trader Needs to Know This: Dr. Thomas Starke on Machine Learning Trading

Policy Gradient Methods | Reinforcement Learning Part 6

Policy Gradient Methods | Reinforcement Learning Part 6

BitNets: La ERA de las REDES NEURONALES de 1 BIT!

BitNets: La ERA de las REDES NEURONALES de 1 BIT!

Aprende Python para ciencia de datos

Aprende Python para ciencia de datos