Introduction to llm-d Distributed Inference on Kubernetes
Автор: Christian Posta
Загружено: 2025-05-27
Просмотров: 894
In this quick virtual lightboard video, we walk through an intro to the llm-d open source project which is a distributed inference serving framework for Kubernetes.
https://llm-d.ai
llm-d uses the Inference extensions to the Kubernetes Gateway API which I did a video about here:
• Quick Introduction to the Inference Extens...
Доступные форматы для скачивания:
Скачать видео mp4
-
Информация по загрузке: