Популярное

Музыка Кино и Анимация Автомобили Животные Спорт Путешествия Игры Юмор

Интересные видео

2025 Сериалы Трейлеры Новости Как сделать Видеоуроки Diy своими руками

Топ запросов

смотреть а4 schoolboy runaway турецкий сериал смотреть мультфильмы эдисон
dTub
Скачать

Leveraging DuckDB and Delta Lake Together

Автор: MotherDuck

Загружено: 2024-07-24

Просмотров: 2484

Описание:

@mehdio will have the pleasure of hosting Holly Smith from @Databricks to chat about all things related to Delta Lake and how its integration with DuckDB works! Get ready to quack and query table formats 👩‍💻

Resources :
DuckDB - Delta lake documentation : https://duckdb.org/docs/extensions/de...

#duckdb #deltalake #dataengineering

--------------------------------------

Explore the powerful integration of the Delta Lake table format with DuckDB in this comprehensive technical deep dive. Joined by Holly Smith, Developer Advocate at Databricks, we uncover why standard Parquet files often fall short for modern data analytics and data engineering workflows. We discuss critical challenges such as schema enforcement, data quality control, and the complexities of handling updates and deletes in a data lake, setting the stage for how the Delta Lake open-source project provides a robust solution for your cloud data warehouse.

Discover the core architecture of Delta Lake, which enhances Parquet files with a transactional metadata layer known as the `_delta_log`. This key innovation brings database-level features like ACID transactions directly to object storage like S3, ensuring data reliability and consistency. We'll break down how this works under the hood, including how the Delta Log tracks file versions and handles operations like deletes efficiently using deletion vectors. This session explains why it's crucial for data engineers to interact with the Delta table abstraction rather than the raw Parquet files.

Get hands-on with practical examples showing how to query Delta Lake tables using DuckDB. We demonstrate the `delta_scan` command for reading data from both local files and large datasets on S3, showcasing the impressive speed DuckDB offers for local development and interactive analysis. We'll also touch on the new Delta Kernel, which aims to standardize and accelerate integrations across the data ecosystem. Learn how to leverage these tools in your workflow and see how MotherDuck can further optimize queries on your cloud data.

Finally, we look ahead at the evolving landscape of data table formats. This discussion covers the convergence of major players like Delta Lake, Apache Iceberg, and Hudi, and what it means for the future of the data warehouse. Gain valuable insights to inform your data architecture decisions, whether you're a data analyst or engineer building a scalable and efficient data platform with modern developer tools.

Leveraging DuckDB and Delta Lake Together

Поделиться в:

Доступные форматы для скачивания:

Скачать видео mp4

  • Информация по загрузке:

Скачать аудио mp3

Похожие видео

What's new in DuckDB & MotherDuck 🦆

What's new in DuckDB & MotherDuck 🦆

How DuckLake Simplifies Lakehouse Architecture ft. Jordan Tigani & Hannes Mühleisen

How DuckLake Simplifies Lakehouse Architecture ft. Jordan Tigani & Hannes Mühleisen

Building the FASTEST Lake House with DuckDB, AWS Lambda, and Delta Lake

Building the FASTEST Lake House with DuckDB, AWS Lambda, and Delta Lake

Understanding DuckLake: A Table Format with a Modern Architecture

Understanding DuckLake: A Table Format with a Modern Architecture

DuckLake Deep Dive: Build a Full Lakehouse with Just Parquet Files and DuckDB

DuckLake Deep Dive: Build a Full Lakehouse with Just Parquet Files and DuckDB

It Depends #66: Jordan Tigani on how DuckDB and MotherDuck power Small Data - Jul ’24

It Depends #66: Jordan Tigani on how DuckDB and MotherDuck power Small Data - Jul ’24

Jazz & Soulful R&B  smooth Grooves  Relaxing instrumental Playlist /Focus/study

Jazz & Soulful R&B smooth Grooves Relaxing instrumental Playlist /Focus/study

Delta Live Tables A to Z: Best Practices for Modern Data Pipelines

Delta Live Tables A to Z: Best Practices for Modern Data Pipelines

Kubernetes — Простым Языком на Понятном Примере

Kubernetes — Простым Языком на Понятном Примере

Understanding Delta Lake - The Heart of the Data Lakehouse

Understanding Delta Lake - The Heart of the Data Lakehouse

Создайте озеро данных для бедных с нуля с помощью DuckDB

Создайте озеро данных для бедных с нуля с помощью DuckDB

Big Data is Dead | MotherDuck

Big Data is Dead | MotherDuck

DuckDB, Apache Arrow, & the Future of Data Engineering w/ Rusty Conover | S2E3

DuckDB, Apache Arrow, & the Future of Data Engineering w/ Rusty Conover | S2E3

Iceberg, Multi-Engine Data Stack and Catalog Hell

Iceberg, Multi-Engine Data Stack and Catalog Hell

Учебное пособие по DuckDB — курс DuckDB для начинающих

Учебное пособие по DuckDB — курс DuckDB для начинающих

Delta Lake Meets DuckDB via Delta Kernel

Delta Lake Meets DuckDB via Delta Kernel

Bringing DuckDB to the Cloud: Dual Execution Explained

Bringing DuckDB to the Cloud: Dual Execution Explained

Что это за дельта-озеро?

Что это за дельта-озеро?

Tech Talk | Diving into Delta Lake Part 1: Unpacking the Transaction Log

Tech Talk | Diving into Delta Lake Part 1: Unpacking the Transaction Log

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

© 2025 dtub. Все права защищены.



  • Контакты
  • О нас
  • Политика конфиденциальности



Контакты для правообладателей: [email protected]