Популярное

Музыка Кино и Анимация Автомобили Животные Спорт Путешествия Игры Юмор

Интересные видео

2025 Сериалы Трейлеры Новости Как сделать Видеоуроки Diy своими руками

Топ запросов

смотреть а4 schoolboy runaway турецкий сериал смотреть мультфильмы эдисон
dTub
Скачать

AI Learns to Park - Deep Reinforcement Learning

AI

Neural Networks

Deep Learning

Reinforcement Learning

Deep Reinforcement Learning

Car

Simulation

3D

Unity

Unity3D

madewithunity

Machine Learning

ML

ANNs

ML-Agents

Unity ML-Agents

PPO

Proximal Policy Optimization

RL

Parking

Автор: Samuel Arzt

Загружено: 23 авг. 2019 г.

Просмотров: 3 053 869 просмотров

Описание:

An AI learns to park a car in a parking lot in a 3D physics simulation. The simulation was implemented using Unity's ML-Agents framework (https://unity3d.com/machine-learning). The AI consists of a deep Neural Network with 3 hidden layers of 128 neurons each. It is trained with the Proximal Policy Optimization (PPO) algorithm, which is a Reinforcement Learning approach.

Basically, the input of the Neural Network are the readings of eight depth sensors, the car's current speed and position, as well as its relative position to the target. The outputs of the Neural Network are interpreted as engine force, braking force and turning force. These outputs can be seen at the top right corner of the zoomed out camera shots.

The AI starts off with random behaviour, i.e. the Neural Network is initialized with random weights. It then gradually learns to solve the task by reacting to environment feedback accordingly. The environment tells the AI whether it is doing good or bad with positive or negative reward signals.
In this project, the AI is rewarded with small positive signals for getting closer to the parking spot, which is outlined in red, and gets a larger reward when it actually reaches the parking spot and stops there. The final reward for reaching the parking spot is dependent on how parallel the car stops in relation to the actual parking position. If the car stops in a 90° angle to the actual parking direction for instance, the AI will only be rewarded a very small amount, relative to the amount it would get for stopping completely parallel to the actual direction.
The AI is penalized with a negative reward signal, when it either drives further away from the parking spot or if it crashes into any obstacles.

The training process shown in this video took about 23 hours on a computer with an i5 (7th or 8th gen) and a GTX 1070 with 100x simulation speed.

Subscribe for more content like this:
   / @samuelarzt  

Follow me on Twitter for more frequent updates on my projects:
  / samuelarzt  

Also check out my other videos related to this Project:

Two AI fight for the same Parking Spot:
   • Two AI Fight for the same Parking Spot  

Neural Networks Explained in a Minute:
   • Explained In A Minute: Neural Networks  

Cars learn to maneuver Parcour with Genetic Algorithm:
   • Deep Learning Cars  

Start Music: "Sunday" by Otis McDonald

Music from Bensound.com:
Timelapse Music: "The Elevator Bossa Nova"
Comedic Background: "Jazz Comedy"
Outro: "All That"

#ArtificialIntelligence #MachineLearning #ReinforcementLearning #AI #NeuralNetworks

AI Learns to Park - Deep Reinforcement Learning

Поделиться в:

Доступные форматы для скачивания:

Скачать видео mp4

  • Информация по загрузке:

Скачать аудио mp3

Похожие видео

An introduction to Reinforcement Learning

An introduction to Reinforcement Learning

Аниме: Чёрный призыватель все серии подряд 1-12 | аниме марафон

Аниме: Чёрный призыватель все серии подряд 1-12 | аниме марафон

Focus music ⚡ 30 minute Pomodoro deep work session 🍅 Music for maximum focus by Brain.fm

Focus music ⚡ 30 minute Pomodoro deep work session 🍅 Music for maximum focus by Brain.fm

Возможно ли Пройти Майнкрафт с Острова?

Возможно ли Пройти Майнкрафт с Острова?

AI Learns Parallel Parking - Deep Reinforcement Learning

AI Learns Parallel Parking - Deep Reinforcement Learning

Introduction to Multi-Agent Reinforcement Learning

Introduction to Multi-Agent Reinforcement Learning

5 Pieces by Hans Zimmer \\ Iconic Soundtracks \\ Relaxing Piano [20min]

5 Pieces by Hans Zimmer \\ Iconic Soundtracks \\ Relaxing Piano [20min]

سورة يسٓ كاملة للشيخ ياسر الدوسري من ليالي رمضان عام 1442 هـ Surah Yaseen

سورة يسٓ كاملة للشيخ ياسر الدوسري من ليالي رمضان عام 1442 هـ Surah Yaseen

Evolving AIs - Predator vs Prey, who will win?

Evolving AIs - Predator vs Prey, who will win?

Таймер 20 Минут

Таймер 20 Минут

© 2025 dtub. Все права защищены.



  • Контакты
  • О нас
  • Политика конфиденциальности



Контакты для правообладателей: [email protected]