Accelerating AI with UALink: Open Memory Fabrics for Scalable Compute

Автор: Ultra Accelerator Link

Загружено: 2025-12-10

Просмотров: 204

Описание:

Memory, not compute, is rapidly becoming the limiting factor for scaling modern AI across GPUs, accelerators, and CPUs. While HBM delivers enormous raw bandwidth, its limited capacity and strict locality quietly cap model size and context length, forcing over-buying of GPUs just to get more memory and leaving expensive compute underutilized. Combined with the high cost of HBM, memory-bound workloads such as LLM inference become more expensive and harder to run profitably as a service.

Fabric-attached memory powered by UALink decouples memory growth from GPU count, enabling independent scaling of memory and compute at near-HBM latency. By introducing a shared memory tier that supports pooling, disaggregation, and cross-node access, UALink brings mainstream DRAM into the accelerator domain as an open, scalable resource, lowering cost per token while unlocking larger models and richer contexts.

In this webinar, UnifabriX showcased its UALink-powered Memory over Fabrics™ platform, designed to accelerate large model workloads and substantially reduce the cost per token. Attendees gained insights into the UALink architecture and discovered how fabric-attached memory unlocks powerful new approaches to KV-cache optimization and RAG acceleration, delivering significant gains in inference speed and efficiency.

Presented by: Ronen Hyatt, CEO and Chief Architect, UnifabriX

Accelerating AI with UALink: Open Memory Fabrics for Scalable Compute

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

Introducing the UALink 200G 1.0 Specification Webinar

Introducing the UALink 200G 1.0 Specification Webinar

UALink 200G 1.0 Specification Overview and Applications

UALink 200G 1.0 Specification Overview and Applications

December 19, 2025, AI-Now - Massive Investments Integrate AI - Deep Dive with Alex and Jessica

December 19, 2025, AI-Now - Massive Investments Integrate AI - Deep Dive with Alex and Jessica

Cracking The Memory Wall

Cracking The Memory Wall

Astera Labs Scaling AI with PCIe, Ethernet, and UALink Retimers

Astera Labs Scaling AI with PCIe, Ethernet, and UALink Retimers

Andrej Karpathy: Software Is Changing (Again)

Andrej Karpathy: Software Is Changing (Again)

LLM fine-tuning или ОБУЧЕНИЕ малой модели? Мы проверили!

LLM fine-tuning или ОБУЧЕНИЕ малой модели? Мы проверили!

Почему нет массовых профессиональных ИИ, как они будут развиваться и что станет итогом.

Почему нет массовых профессиональных ИИ, как они будут развиваться и что станет итогом.

Чем ОПАСЕН МАХ? Разбор приложения специалистом по кибер безопасности

Чем ОПАСЕН МАХ? Разбор приложения специалистом по кибер безопасности

Скважина, которая чуть нас не разорила. Сколько стоит вода.

Скважина, которая чуть нас не разорила. Сколько стоит вода.

«Что не так с квантовой физикой и путешествиями во времени?» – Д. Горбунов, А. Арбузов, А. Семихатов

«Что не так с квантовой физикой и путешествиями во времени?» – Д. Горбунов, А. Арбузов, А. Семихатов

Почему тебе нужен свой домашний сервер? Показываю реальный опыт HOMELAB

Почему тебе нужен свой домашний сервер? Показываю реальный опыт HOMELAB

I built a private AI mini-cluster with Framework Desktop

I built a private AI mini-cluster with Framework Desktop

Как Костко завоевал Америку

Как Костко завоевал Америку

A Systematic Approach To Designing AI Accelerator Hardware

A Systematic Approach To Designing AI Accelerator Hardware

Интернет в небе: Сергей

Интернет в небе: Сергей "Флеш" о том, как «Шахеды» и «Герберы» научились работать в одной связке

AI Accelerators: Transforming Scalability & Model Efficiency

AI Accelerators: Transforming Scalability & Model Efficiency

What “Memory Bandwidth” Hides

What “Memory Bandwidth” Hides

HBM3 In The Data Center

HBM3 In The Data Center

Чат ПГТ 5.2 - это похоронная. Самый УЖАСНЫЙ релиз в истории ИИ

Чат ПГТ 5.2 - это похоронная. Самый УЖАСНЫЙ релиз в истории ИИ