Популярное

Музыка Кино и Анимация Автомобили Животные Спорт Путешествия Игры Юмор

Интересные видео

2025 Сериалы Трейлеры Новости Как сделать Видеоуроки Diy своими руками

Топ запросов

смотреть а4 schoolboy runaway турецкий сериал смотреть мультфильмы эдисон
dTub
Скачать

Building a Multi-Cluster Privately Hosted LLM Serving Platform on Ku... Julian Bright & Noah Yoshida

Автор: CNCF [Cloud Native Computing Foundation]

Загружено: 2023-11-13

Просмотров: 2634

Описание:

Building a Multi-Cluster Privately Hosted LLM Serving Platform on Kubernetes - Julian Bright & Noah Yoshida, Predibase

Large language models (LLMs) have taken the tech industry by storm, due to their powerful capabilities, and accessibility through APIs like ChatGTP. However, hosting your own LLM can be very challenging due to their large size and GPU resource requirements. In this session, we will take you through our journey at Predibase in building a cloud agnostic privately hosted LLM serving platform on Kubernetes.. We will cover in detail the architecture of our control plane, and dataplane secured with an Istio service mesh, as well as our use of KEDA for event driven auto scaling to support serverless inference of open-source models. By the end of the talk, attendees will have a better understanding of some of the challenges in deploying LLMs, and how to apply some of the tools and techniques we adopted in their own organization.

Building a Multi-Cluster Privately Hosted LLM Serving Platform on Ku... Julian Bright & Noah Yoshida

Поделиться в:

Доступные форматы для скачивания:

Скачать видео mp4

  • Информация по загрузке:

Скачать аудио mp3

Похожие видео

How We Power the Largest AI Deployments on the Planet: Running Vir... Brandon Jacobs & Lukas Gentele

How We Power the Largest AI Deployments on the Planet: Running Vir... Brandon Jacobs & Lukas Gentele

Building Massive-Scale Generative AI Services with Kubernetes and Open Source - John McBride

Building Massive-Scale Generative AI Services with Kubernetes and Open Source - John McBride

Best Practices for Deploying LLM Inference, RAG and Fine Tuning Pipelines... M. Kaushik, S.K. Merla

Best Practices for Deploying LLM Inference, RAG and Fine Tuning Pipelines... M. Kaushik, S.K. Merla

Building an LLMOps Stack for Large Language Models | LLMs

Building an LLMOps Stack for Large Language Models | LLMs

Introduction to Distributed ML Workloads with Ray on Kubernetes - Mofi Rahman & Abdel Sghiouar

Introduction to Distributed ML Workloads with Ray on Kubernetes - Mofi Rahman & Abdel Sghiouar

Optimizing Load Balancing and Autoscaling for Large Language Model (LLM) Inference on Kub... D. Gray

Optimizing Load Balancing and Autoscaling for Large Language Model (LLM) Inference on Kub... D. Gray

DKT86: Ingress NGINX уходит на пенсию: миграция на Gateway API и не только

DKT86: Ingress NGINX уходит на пенсию: миграция на Gateway API и не только

Ep. 19 AI Gravity: On-Premises vs. Public Cloud for AI Systems | AI Insights & Innovation

Ep. 19 AI Gravity: On-Premises vs. Public Cloud for AI Systems | AI Insights & Innovation

Secure LLM Architecture - Testing LLM Guard

Secure LLM Architecture - Testing LLM Guard

Почему MCP действительно важен | Модель контекстного протокола с Тимом Берглундом

Почему MCP действительно важен | Модель контекстного протокола с Тимом Берглундом

Choosing Your Champion: LLM Inference Backend Benchmarks

Choosing Your Champion: LLM Inference Backend Benchmarks

Introduction to Large Language Models (LLM) on Kubernetes - Alexander Schaber

Introduction to Large Language Models (LLM) on Kubernetes - Alexander Schaber

Jazz & Soulful R&B  smooth Grooves  Relaxing instrumental Playlist /Focus/study

Jazz & Soulful R&B smooth Grooves Relaxing instrumental Playlist /Focus/study

Spotify's Approach to Distributed LLM Training with Ray on GKE | Ray Summit 2024

Spotify's Approach to Distributed LLM Training with Ray on GKE | Ray Summit 2024

Топ технологий 2025г. Выставка автомобилей в г.Гуанчжоу.

Топ технологий 2025г. Выставка автомобилей в г.Гуанчжоу.

What are Cilium & Hubble - With Thomas Graf

What are Cilium & Hubble - With Thomas Graf

Музыка для работы - Deep Focus Mix для программирования, кодирования

Музыка для работы - Deep Focus Mix для программирования, кодирования

A Computer Cluster Made With BROKEN PCs

A Computer Cluster Made With BROKEN PCs

Training and Serving LLM’s on Kubernetes: A beginner’s guide - Abdel Sghiouar

Training and Serving LLM’s on Kubernetes: A beginner’s guide - Abdel Sghiouar

Музыка для работы за компьютером | Фоновая музыка для концентрации и продуктивности

Музыка для работы за компьютером | Фоновая музыка для концентрации и продуктивности

© 2025 dtub. Все права защищены.



  • Контакты
  • О нас
  • Политика конфиденциальности



Контакты для правообладателей: [email protected]