ML Performance Reading Group Session 1: GPU Architecture, CUDA, NCCL

Автор: EleutherAI

Загружено: 2024-12-05

Просмотров: 4229

Описание:

ML Performance research paper reading group session 1 meeting (2024/11/29). This was an intro session covering prerequisite knowledge related to GPU architecture, CUDA, NCCL, and common performance bottlenecks in ML workloads.

Presenter: Daniel Vega-Myhre

ML Performance Reading Group Session 1: GPU Architecture, CUDA, NCCL

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

ML Performance Reading Group Session 2: Flash Attention

ML Performance Reading Group Session 2: Flash Attention

MultiGPU + NCCL from the authors

MultiGPU + NCCL from the authors

Stanford CS149 I Parallel Computing I 2023 I Kayvon Fatahalian and Kunle Olukotun

Stanford CS149 I Parallel Computing I 2023 I Kayvon Fatahalian and Kunle Olukotun

Делаем графические процессоры по-настоящему быстрыми: глубокий анализ эффективности тренировок

Делаем графические процессоры по-настоящему быстрыми: глубокий анализ эффективности тренировок

What is CUDA? - Computerphile

What is CUDA? - Computerphile

Understanding the Transformer Architecture

Understanding the Transformer Architecture

Data Center Networking for Administrators

Data Center Networking for Administrators

Demystifying NCCL An In depth Analysis of GPU Communication Protocols and Algorithms - Zhiyi Hu

Demystifying NCCL An In depth Analysis of GPU Communication Protocols and Algorithms - Zhiyi Hu

Getting Started with CUDA and Parallel Programming | NVIDIA GTC 2025 Session

Getting Started with CUDA and Parallel Programming | NVIDIA GTC 2025 Session

Lecture 44: NVIDIA Profiling

Lecture 44: NVIDIA Profiling

Distributed ML Talk @ UC Berkeley

Distributed ML Talk @ UC Berkeley

Lecture 67: NCCL and NVSHMEM

Lecture 67: NCCL and NVSHMEM

GTC 2022 - How CUDA Programming Works - Stephen Jones, CUDA Architect, NVIDIA

GTC 2022 - How CUDA Programming Works - Stephen Jones, CUDA Architect, NVIDIA

How do Graphics Cards Work? Exploring GPU Architecture

How do Graphics Cards Work? Exploring GPU Architecture

The Chaotic State of GPU Programming

The Chaotic State of GPU Programming

Trends in Deep Learning Hardware: Bill Dally (NVIDIA)

Trends in Deep Learning Hardware: Bill Dally (NVIDIA)

Hard и soft skills, без которых не попасть в ML

Hard и soft skills, без которых не попасть в ML

NCCL: High-Speed Inter-GPU Communication for Large-Scale Training - Sylvain Jeaugey, NVIDIA

NCCL: High-Speed Inter-GPU Communication for Large-Scale Training - Sylvain Jeaugey, NVIDIA

Tutorial: GPU Communication Libraries for Accelerating HPC and AI Applications

Tutorial: GPU Communication Libraries for Accelerating HPC and AI Applications

Learn RDMA Programming: NVIDIA’s Guide to High-Performance Networking

Learn RDMA Programming: NVIDIA’s Guide to High-Performance Networking