Network Fabrics for AI Workloads
Автор: Open Compute Project
Загружено: 2023-11-01
Просмотров: 2573
Presented by Senthil Kumar Ganesan (Dell Technologies) & Venkatesan Mahalingam (Dell)
The networking demands are escalating as AI workloads continue to grow, particularly with the momentum of Large Language Models to support trillions of parameters. An efficient network fabric within AI processing is not just a functional necessity but vital to minimizing the training time. It enables the seamless collaboration and integration of thousands of interconnected GPUs, all working as a unified computing system to train complex models. This underscores the critical need for flawless communication and sophisticated networking solutions.
SONiC (Software for Open Networking in the Cloud) is gaining traction as a preferred Network Operating System for handling AI workloads. Its flexibility and adaptability present both thrilling opportunities and distinct challenges. This presentation will delve into the specific requirements of AI Network Fabrics, explore AI use cases that SONiC can currently address with its existing switch ASIC capabilities, and identify the features that need to be enhanced in SONiC to support additional AI applications. The discussion will be grounded in the real-world experiences of customers adopting SONiC for their AI workloads.
Доступные форматы для скачивания:
Скачать видео mp4
-
Информация по загрузке: