High Concurrency Mode for Notebooks in Pipelines for Fabric Spark
Автор: Azure Synapse Analytics
Загружено: 2024-11-19
Просмотров: 2021
With high concurrency mode, we’re bringing a game-changing way to orchestrate your data ingestion and transformation processes in enterprise workflows. Notebooks in pipelines now leverage shared, high-performance sessions, combining speed with cost-efficiency—all while maintaining top-tier security.
Imagine a pipeline with five notebooks, each running 5 minutes. Normally, the 3-minute Spark start time per step would push your pipeline to 40 minutes. With high concurrency, the total runtime drops to 28 minutes—a 30% speed improvement.
Unlock faster workflows, lower costs, and a seamless data journey with high concurrency mode. Get ready to experience the next level of pipeline orchestration! 🎉
https://blog.fabric.microsoft.com/en-...
🎙 Meet the Speakers:
👤 Guest from Product Group: Santhosh Kumar Ravindran, Senior Product Manager
Santhosh Ravindran currently leads Spark Compute and Settings for Microsoft Fabric Spark. He focuses on building capabilities that meet the data engineering needs like Spark Pools, Queueing and Scheduling, and Job orchestration for big data workloads in large enterprises using Spark. Prior to this, Santhosh was a Product Manager and Engineering Lead building Metadata scanners, access policy orchestration, lineage and data catalog systems for enterprise data estates as part of the Microsoft Governance Platform (Microsoft Purview).
LinkedIn: / thisissanthoshkumar
Twitter: / iamsanthoshkr
👤 Host: Estera Kot, Principal Product Manager at Microsoft.
LinkedIn: / esterakot
Twitter: / estera_kot
👍 Liked this video? Don't forget to hit the 'Like' button and share it with your peers!
Доступные форматы для скачивания:
Скачать видео mp4
-
Информация по загрузке: