How to Scale Unstructured Data Processing with Ray Data | Ray Summit 2024

Автор: Anyscale

Загружено: 2024-10-18

Просмотров: 784

Описание:

At Ray Summit 2024, Hao Chen and Praveen Gorthy from Anyscale tackle the challenge of processing unstructured data at scale. As images, videos, and other unstructured data formats become more popular, the associated data sizes grow exponentially—and traditional frameworks struggle to keep up. This talk introduces Ray Data on Anyscale as a solution to this pressing issue.

The speakers explore Ray Data's streaming batch model and adaptive scheduling, demonstrating how these features efficiently handle the heterogeneous compute requirements of unstructured data workloads. They also highlight Anyscale's enhancements to Ray Data, including autoscaling, fault tolerance, and performance optimizations.

A key feature of this presentation is a live demo, showcasing the development and scaling of an unstructured data processing workflow using Ray Data on Anyscale. Attendees will see firsthand how Anyscale's observability tools provide real-time insights into workload performance and resource utilization, enabling on-the-fly pipeline optimization.

This session is invaluable for data scientists, engineers, and organizations grappling with large-scale unstructured data processing, offering practical solutions to improve performance and cost-efficiency in their data pipelines.

--

Interested in more?
Watch the full Day 1 Keynote:    • Ray Summit 2024 Keynote Day 1 | Where Buil...
Watch the full Day 2 Keynote    • Ray Summit 2024 Keynote Day 2 | Where Buil...

--

🔗 Connect with us:
Subscribe to our YouTube channel:    / @anyscale
Twitter: https://x.com/anyscalecompute
LinkedIn:   / joinanyscale
Website: https://www.anyscale.com

How to Scale Unstructured Data Processing with Ray Data | Ray Summit 2024

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

Faster Model Serving with Ray and Anyscale | Ray Summit 2024

Faster Model Serving with Ray and Anyscale | Ray Summit 2024

Scaling LLM Batch Inference: Ray Data & vLLM for High Throughput

Scaling LLM Batch Inference: Ray Data & vLLM for High Throughput

Charese Gates Fusion Presentation Video

Charese Gates Fusion Presentation Video

Transforming Multimodal Data Management with LanceDB-Ray | Ray Summit 2024

Transforming Multimodal Data Management with LanceDB-Ray | Ray Summit 2024

Ray: A Framework for Scaling and Distributing Python & ML Applications

Ray: A Framework for Scaling and Distributing Python & ML Applications

Шаблоны проектирования для архитектуры решений в области ИИ

Шаблоны проектирования для архитектуры решений в области ИИ

Anyscale's Ray Data: Revolutionizing Batch Inference | Ray Summit 2024

Anyscale's Ray Data: Revolutionizing Batch Inference | Ray Summit 2024

How Adobe Builds And Trains Its Generative AI Models

How Adobe Builds And Trains Its Generative AI Models

Easy Python Parallelism using Ray

Easy Python Parallelism using Ray

Экспресс-курс RAG для начинающих

Экспресс-курс RAG для начинающих

Fast, Flexible, and Scalable Data Loading for ML Training with Ray Data

Fast, Flexible, and Scalable Data Loading for ML Training with Ray Data

150.000 руб. в месяц - это НИЩЕТА. Шокирующий прогноз на 2026 год

150.000 руб. в месяц - это НИЩЕТА. Шокирующий прогноз на 2026 год

КАК УСТРОЕН TCP/IP?

КАК УСТРОЕН TCP/IP?

Ray Data Streaming for Large-Scale ML Training and Inference

Ray Data Streaming for Large-Scale ML Training and Inference

Beginner's Guide to Ray! Ray Explained

Beginner's Guide to Ray! Ray Explained

Meta's Roadmap for Full Stack AI: Insights from Joe Spisak | Ray Summit 2024

Meta's Roadmap for Full Stack AI: Insights from Joe Spisak | Ray Summit 2024

Музыка для работы за компьютером | Фоновая музыка для концентрации и продуктивности

Музыка для работы за компьютером | Фоновая музыка для концентрации и продуктивности

The Future of AI Infrastructure: Anyscale Keynote | Ray on the Road – NYC 2025

The Future of AI Infrastructure: Anyscale Keynote | Ray on the Road – NYC 2025

GraphRAG: союз графов знаний и RAG: Эмиль Эйфрем

GraphRAG: союз графов знаний и RAG: Эмиль Эйфрем

Будущее СВО и переговоров: ключевые события 2025 // «Февраль 24/7»

Будущее СВО и переговоров: ключевые события 2025 // «Февраль 24/7»