GELU Activation Function in the LLM Architecture
Автор: Vizuara
Загружено: 14 окт. 2024 г.
Просмотров: 4 723 просмотра
In this lecture, we learn about an important component of the LLM architecture: GELU activation function and the feedforward neural network.
We understand everything about the GELU activation function and why it is needed. Then we learn how to integrate the GELU activation function with the feedforward neural network.
We understand the theory and also do the coding for the GELU activation function and the feedforward neural network.
The key reference book which this video series very closely follows is Build a Large Language Model from Scratch by Manning Publications. All schematics and their descriptions are borrowed from this incredible book!
This book serves as a comprehensive guide to understanding and building large language models, covering key concepts, techniques, and implementations.
Affiliate links for purchasing the book will be added soon. Stay tuned for updates!
0:00 Introduction
3:58 GELU activation mathematics
8:49 Why do we use GELU?
10:59 Coding the GELU activation class
13:54 Feed forward neural network architecture
19:56 Coding the feedforward neural network class
24:26 Feedforward neural network advantages
26:32 Summary
Link to code file: https://drive.google.com/file/d/1k4Tw...
=================================================
✉️ Join our FREE Newsletter: https://vizuara.ai/our-newsletter/
=================================================
Vizuara philosophy:
As we learn AI/ML/DL the material, we will share thoughts on what is actually useful in industry and what has become irrelevant. We will also share a lot of information on which subject contains open areas of research. Interested students can also start their research journey there.
Students who are confused or stuck in their ML journey, maybe courses and offline videos are not inspiring enough. What might inspire you is if you see someone else learning and implementing machine learning from scratch.
No cost. No hidden charges. Pure old school teaching and learning.
=================================================
🌟 Meet Our Team: 🌟
🎓 Dr. Raj Dandekar (MIT PhD, IIT Madras department topper)
🔗 LinkedIn: / raj-abhijit-dandekar-67a33118a
🎓 Dr. Rajat Dandekar (Purdue PhD, IIT Madras department gold medalist)
🔗 LinkedIn: / rajat-dandekar-901324b1
🎓 Dr. Sreedath Panat (MIT PhD, IIT Madras department gold medalist)
🔗 LinkedIn: / sreedath-panat-8a03b69a
🎓 Sahil Pocker (Machine Learning Engineer at Vizuara)
🔗 LinkedIn: / sahil-p-a7a30a8b
🎓 Abhijeet Singh (Software Developer at Vizuara, GSOC 24, SOB 23)
🔗 LinkedIn: / abhijeet-singh-9a1881192
🎓 Sourav Jana (Software Developer at Vizuara)
🔗 LinkedIn: / souravjana131

Доступные форматы для скачивания:
Скачать видео mp4
-
Информация по загрузке: