mixup: Beyond Empirical Risk Minimization (Paper Explained)

Автор: Yannic Kilcher

Загружено: 27 мая 2020 г.

Просмотров: 11 892 просмотра

Описание:

Neural Networks often draw hard boundaries in high-dimensional space, which makes them very brittle. Mixup is a technique that linearly interpolates between data and labels at training time and achieves much smoother and more regular class boundaries.

OUTLINE:
0:00 - Intro
0:30 - The problem with ERM
2:50 - Mixup
6:40 - Code
9:35 - Results

https://arxiv.org/abs/1710.09412

Abstract:
Large deep neural networks are powerful, but exhibit undesirable behaviors such as memorization and sensitivity to adversarial examples. In this work, we propose mixup, a simple learning principle to alleviate these issues. In essence, mixup trains a neural network on convex combinations of pairs of examples and their labels. By doing so, mixup regularizes the neural network to favor simple linear behavior in-between training examples. Our experiments on the ImageNet-2012, CIFAR-10, CIFAR-100, Google commands and UCI datasets show that mixup improves the generalization of state-of-the-art neural network architectures. We also find that mixup reduces the memorization of corrupt labels, increases the robustness to adversarial examples, and stabilizes the training of generative adversarial networks.

Authors: Hongyi Zhang, Moustapha Cisse, Yann N. Dauphin, David Lopez-Paz

Links:
YouTube: / yannickilcher
Twitter: / ykilcher
BitChute: https://www.bitchute.com/channel/yann...
Minds: https://www.minds.com/ykilcher

mixup: Beyond Empirical Risk Minimization (Paper Explained)

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

[Classic] Deep Residual Learning for Image Recognition (Paper Explained)

[Classic] Deep Residual Learning for Image Recognition (Paper Explained)

SynFlow: Pruning neural networks without any data by iteratively conserving synaptic flow

SynFlow: Pruning neural networks without any data by iteratively conserving synaptic flow

Neural Architecture Search without Training (Paper Explained)

Neural Architecture Search without Training (Paper Explained)

Big Self-Supervised Models are Strong Semi-Supervised Learners (Paper Explained)

Big Self-Supervised Models are Strong Semi-Supervised Learners (Paper Explained)

The Most Dangerous Building in Manhattan

The Most Dangerous Building in Manhattan

Computer Scientist Explains One Concept in 5 Levels of Difficulty | WIRED

Computer Scientist Explains One Concept in 5 Levels of Difficulty | WIRED

Why The World Relies On ASML For Machines That Print Chips

Why The World Relies On ASML For Machines That Print Chips

Всё, что надо понимать про авто. 10 правил профессионала.

Всё, что надо понимать про авто. 10 правил профессионала.

24 часа в городе без законов: и воздуха: жизнь на высоте при 50% кислорода

24 часа в городе без законов: и воздуха: жизнь на высоте при 50% кислорода

Wildberries отойдет друзьям Путина. Кто они и как обошли Кадырова

Wildberries отойдет друзьям Путина. Кто они и как обошли Кадырова