Lecture 80: How FlashAttention 4 Works
Автор: GPU MODE
Загружено: 2025-10-01
Просмотров: 4357
Speaker: Charles Frye
The source code (in CuTe) for FlashAttention4 on Blackwell GPUs has recently been released for the forward pass. The following blog: https://modal.com/blog/reverse-engine... goes over their findings when reading through the source code, and changes between FA1,2,3 and now 4!
Доступные форматы для скачивания:
Скачать видео mp4
-
Информация по загрузке: