Популярное

Музыка Кино и Анимация Автомобили Животные Спорт Путешествия Игры Юмор

Интересные видео

2025 Сериалы Трейлеры Новости Как сделать Видеоуроки Diy своими руками

Топ запросов

смотреть а4 schoolboy runaway турецкий сериал смотреть мультфильмы эдисон
dTub
Скачать

Scaling AI inference with open source ft. Brian Stevens | Technically Speaking with Chris Wright

Автор: Red Hat

Загружено: 2025-06-04

Просмотров: 1238

Описание:

How are enterprises re-imagining AI for real-world impact? Chris Wright, Red Hat CTO and SVP Global Engineering sits down with Brian Stevens, Red Hat SVP and AI CTO, to discuss the journey towards production-quality AI inference at scale. They explore the critical role of open source projects like vLLM, the evolution from CPU to GPU optimization, and the parallels between today's AI challenges and the early days of enterprise Linux.

00:00:37 - Brian Stevens on returning to Red Hat & parallels with early Linux
00:02:00 - The path from cloud to AI & the impact of ChatGPT
00:03:58 - Pivoting to GPUs & the rise of vLLM for generative AI
00:05:48 - From CPU sparsification to GPU model compression
00:08:00 - Optimizing for modern GPUs with vLLM
00:11:38 - An ""AI Operating System""? Integrating vLLM with Kubernetes
00:15:31 - vLLM: A common platform for diverse AI hardware & models
00:17:41 - The importance of distributed KV cache for scalable inference
00:22:53 - Inference-time scaling, reasoning, and platform Demands
00:25:10 - Ecosystem & Community: The key to AI's future

Learn More:
Red Hat AI Solutions: https://www.redhat.com/en/products/ai
vLLM Project: https://docs.vllm.ai/
vLLM GitHub: https://github.com/vllm-project/vllm

Follow us:
Chris Wright (LinkedIn):   / chris-wright-b733851  
Brian Stevens (LinkedIn):   / brianmarkstevens  

What is Technically Speaking?
Technically Speaking taps into emerging technology trends with insights from leading experts across the globe and Red Hat CTO Chris Wright. The series blends deep-dive discussions, tech updates, and creative short-form content, solidifying Red Hat’s role as a pioneer in technology innovation and open source thought leadership.

Want to participate? Leave us a comment if there's a topic or a guest you'd like to see featured.

Watch More Technically Speaking:
YouTube playlist:    • Technically Speaking with Chris Wright  
Show Page: https://www.redhat.com/en/technically...
Subscribe to Red Hat's YouTube channel: https://www.youtube.com/redhat/?sub_c...

#RedHat #TechnicallySpeaking #AIInference #vLLM #EnterpriseAI #OpenSource #BrianStevens #PracticalAI #llmd"

Scaling AI inference with open source ft. Brian Stevens | Technically Speaking with Chris Wright

Поделиться в:

Доступные форматы для скачивания:

Скачать видео mp4

  • Информация по загрузке:

Скачать аудио mp3

Похожие видео

array(10) { [0]=> object(stdClass)#6152 (5) { ["video_id"]=> int(9999999) ["related_video_id"]=> string(11) "GGokllUeDTA" ["related_video_title"]=> string(58) "If You’re Tired, This Might Be Exactly What You Need" ["posted_time"]=> string(25) "2 недели назад" ["channelName"]=> string(11) "Stay Forth " } [1]=> object(stdClass)#6125 (5) { ["video_id"]=> int(9999999) ["related_video_id"]=> string(11) "1XF-NG_35NE" ["related_video_title"]=> string(81) "What's next for AI at DeepMind, Google's artificial intelligence lab | 60 Minutes" ["posted_time"]=> string(23) "1 месяц назад" ["channelName"]=> string(10) "60 Minutes" } [2]=> object(stdClass)#6150 (5) { ["video_id"]=> int(9999999) ["related_video_id"]=> string(11) "spIquD_mBFk" ["related_video_title"]=> string(49) "The AI Math That Left Number Theorists Speechless" ["posted_time"]=> string(25) "4 недели назад" ["channelName"]=> string(14) "Curt Jaimungal" } [3]=> object(stdClass)#6157 (5) { ["video_id"]=> int(9999999) ["related_video_id"]=> string(11) "o8NPllzkFhE" ["related_video_title"]=> string(44) "The mind behind Linux | Linus Torvalds | TED" ["posted_time"]=> string(19) "9 лет назад" ["channelName"]=> string(3) "TED" } [4]=> object(stdClass)#6136 (5) { ["video_id"]=> int(9999999) ["related_video_id"]=> string(11) "KFgwXXWT7sQ" ["related_video_title"]=> string(170) "ИИ-агенты — вот что действительно изменит разработку. Пишем ИИ-агент на Python, LangChain и GigaChat" ["posted_time"]=> string(23) "1 месяц назад" ["channelName"]=> string(29) "Диджитализируй!" } [5]=> object(stdClass)#6154 (5) { ["video_id"]=> int(9999999) ["related_video_id"]=> string(11) "7j_NE6Pjv-E" ["related_video_title"]=> string(64) "Model Context Protocol (MCP), clearly explained (why it matters)" ["posted_time"]=> string(25) "3 месяца назад" ["channelName"]=> string(13) "Greg Isenberg" } [6]=> object(stdClass)#6149 (5) { ["video_id"]=> int(9999999) ["related_video_id"]=> string(11) "w87UvmMcmW4" ["related_video_title"]=> string(47) "Microsoft CEO Satya Nadella on the Future of AI" ["posted_time"]=> string(25) "4 недели назад" ["channelName"]=> string(14) "Matthew Berman" } [7]=> object(stdClass)#6159 (5) { ["video_id"]=> int(9999999) ["related_video_id"]=> string(11) "3MygnjdqNWc" ["related_video_title"]=> string(63) "Are We at the End of Ai Progress? — With Gary Marcus" ["posted_time"]=> string(23) "1 месяц назад" ["channelName"]=> string(15) "Alex Kantrowitz" } [8]=> object(stdClass)#6135 (5) { ["video_id"]=> int(9999999) ["related_video_id"]=> string(11) "B1ULvYY-0Uo" ["related_video_title"]=> string(124) "Закон сохранения энергии — величайшее заблуждение физики [Veritasium]" ["posted_time"]=> string(24) "20 часов назад" ["channelName"]=> string(10) "Vert Dider" } [9]=> object(stdClass)#6153 (5) { ["video_id"]=> int(9999999) ["related_video_id"]=> string(11) "HHdaGqCWotc" ["related_video_title"]=> string(161) "Как айтишники из Беларуси построили первый в Европе фемтех-единорог. Юрий Гурский про Flo" ["posted_time"]=> string(25) "3 недели назад" ["channelName"]=> string(28) "Это Осетинская!" } }
If You’re Tired, This Might Be Exactly What You Need

If You’re Tired, This Might Be Exactly What You Need

What's next for AI at DeepMind, Google's artificial intelligence lab | 60 Minutes

What's next for AI at DeepMind, Google's artificial intelligence lab | 60 Minutes

The AI Math That Left Number Theorists Speechless

The AI Math That Left Number Theorists Speechless

The mind behind Linux | Linus Torvalds | TED

The mind behind Linux | Linus Torvalds | TED

ИИ-агенты — вот что действительно изменит разработку. Пишем ИИ-агент на Python, LangChain и GigaChat

ИИ-агенты — вот что действительно изменит разработку. Пишем ИИ-агент на Python, LangChain и GigaChat

Model Context Protocol (MCP), clearly explained (why it matters)

Model Context Protocol (MCP), clearly explained (why it matters)

Microsoft CEO Satya Nadella on the Future of AI

Microsoft CEO Satya Nadella on the Future of AI

Are We at the End of Ai Progress? — With Gary Marcus

Are We at the End of Ai Progress? — With Gary Marcus

Закон сохранения энергии — величайшее заблуждение физики [Veritasium]

Закон сохранения энергии — величайшее заблуждение физики [Veritasium]

Как айтишники из Беларуси построили первый в Европе фемтех-единорог. Юрий Гурский про Flo

Как айтишники из Беларуси построили первый в Европе фемтех-единорог. Юрий Гурский про Flo

© 2025 dtub. Все права защищены.



  • Контакты
  • О нас
  • Политика конфиденциальности



Контакты для правообладателей: [email protected]