Transformer Attention Explained By Example

Автор: Kie Codes

Загружено: 2024-01-18

Просмотров: 3537

Описание:

Attention mechanism is key in Transformer models. It's a big idea in recent years but not easy to understand. In this video, I explain the attention mechanism from the start. We'll look at Scaled Dot Product Attention, and how Key, Query, and Value matrices are trained. Plus, we'll see how these parts make up an Attention Head or Attention Layer. If you're interested in AI and Transformers, this video makes it easy to understand this important part.

📹 Video about the Transformer Architecture:
   • Transformers Explained by Example

🙏 Support me:   / kiecodes
🛰 Join our Discord, to interact with other Coders and me:   / discord
🧠 Pick my brain: https://calendly.com/kiecodes/ai-cons...

Check out my newest video:    • How to use the OpenAI API in Python

Timestamps:
00:00 Intro
00:22 What is Attention
00:55 What are Attention Layers or Attention Heads
02:55 What is Multi-Head Attention Layer
04:23 What's in an Attention Layer
05:00 The Attention Function
11:36 Normalisation
14:19 Putting it all together
16:07 Masked Attention
17:48 Cross Attention

---

This video contains advertising content.

---

Attribution:
■ Video von Tima Miroshnichenko: https://www.pexels.com/de-de/video/pe...
■ Video von Tima Miroshnichenko: https://www.pexels.com/de-de/video/ma...
■ Brain icons created by Freepik - Flaticon: https://www.flaticon.com/free-icons/b...

Transformer Attention Explained By Example

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

How does OpenAI‘s Sora work?

How does OpenAI‘s Sora work?

Transformers Explained by Example

Transformers Explained by Example

The Strange Pattern Behind (Almost) All Extreme Events - The Power Law

The Strange Pattern Behind (Almost) All Extreme Events - The Power Law

Может быть, некоторым людям стоит просто сдаться.

Может быть, некоторым людям стоит просто сдаться.

The End of GPUs? How Resistive Memories Could Revolutionize AI

The End of GPUs? How Resistive Memories Could Revolutionize AI

Классификатор изображений в PyTorch

Классификатор изображений в PyTorch

Future of AI Is Built on 3D, Not Language Alone

Future of AI Is Built on 3D, Not Language Alone

Convolutional Neural Networks Explained By Example

Convolutional Neural Networks Explained By Example

AI & Machine Learning

AI & Machine Learning

Создайте свою первую нейронную сеть с помощью TensorFlow

Создайте свою первую нейронную сеть с помощью TensorFlow

Создайте свою первую нейронную сеть с помощью PyTorch

Создайте свою первую нейронную сеть с помощью PyTorch

Мессенджер Max - это аналог китайского WeChat. Но на самом деле это не так

Мессенджер Max - это аналог китайского WeChat. Но на самом деле это не так

Elon Musk: A Different Conversation w/ Nikhil Kamath | Full Episode | People by WTF Ep. 16

Elon Musk: A Different Conversation w/ Nikhil Kamath | Full Episode | People by WTF Ep. 16

Why LLMs Can’t Solve Complex Planning Problems

Why LLMs Can’t Solve Complex Planning Problems

All Seven $1,000,000 Millennium Questions (4 Dummies)

All Seven $1,000,000 Millennium Questions (4 Dummies)

Grammatical Evolution

Grammatical Evolution

Худший технический продукт века [Friend.com]

Худший технический продукт века [Friend.com]

1.144 ACT4ED Special Lecture - Petar Veličković (Google DeepMind): Filter Equivariant Functions

1.144 ACT4ED Special Lecture - Petar Veličković (Google DeepMind): Filter Equivariant Functions

Image Classifier in TensorFlow

Image Classifier in TensorFlow

13 новых открытий, сделанных искусственным интеллектом (часть 4)

13 новых открытий, сделанных искусственным интеллектом (часть 4)