Transform data lake to data lakehouse using Apache Iceberg | Real time ETL | Kafka | Data Lake
Автор: BI Insights Inc
Загружено: 2024-06-08
Просмотров: 4702
🚀 Exciting News! Today we're transforming the open source data lake to a data lakehouse! 🌊📊
🌊 Imagine combining the best of data lakes and data warehouses into one powerful, unified system. That's exactly what a data lakehouse offers! 🌟
🔍 Key Benefits:
Scalability & Flexibility: Easily manage vast amounts of structured and unstructured data.
Cost-Efficiency: Optimize storage costs with tiered data storage.
Real-Time Analytics: Enable faster insights with integrated data processing.
Simplified Architecture: Reduce complexity by consolidating your data ecosystem.
🔗 Whether you’re dealing with big data analytics, machine learning, or real-time data processing, a data lakehouse is the innovative solution that bridges the gap between traditional data warehouses and modern data lakes.
🚀 Embrace the future of data with a data lakehouse and transform the way you handle data!
#apachekafka #datalakehouse #etl
Link to data lake GitHub repo: https://github.com/hnawaz007/pythonda...
Link to Kafka GitHub repo: https://github.com/hnawaz007/pythonda...
Link to the whole series: https://hnawaz007.github.io/datalake....
Link to Kafka Spark series: • PySpark | Apache Spark
Link to Data Lake video: • How to build on-premise Data Lake? | Build...
Link to real-time data analysis using Clickhouse and Streamlit: • Kafka Real-Time data analysis with Streaml...
Link to confluent S3 connector: https://www.confluent.io/hub/confluen...
Link to S3 connector configs: https://blog.min.io/kafka_and_minio/
Link to Apache Iceberge Deep Dive video: • Data Lakehouse workflow Apache Iceberg and...
Link to Channel's site:
https://hnawaz007.github.io/
--------------------------------------------------------------
💥Subscribe to our channel:
/ haqnawaz
📌 Links
-----------------------------------------
Follow me on social media!
🔗 GitHub: https://github.com/hnawaz007
📸 Instagram: / bi_insights_inc
📝 LinkedIn: / haq-nawaz
🔗 / hnawaz100
🚀 https://hnawaz007.github.io/
-----------------------------------------
Topics in this video (click to jump around):
==================================
0:00 - Introduction to Data Lake, Data Lakehouse, Iceberge
1:25 - Create Avro S3 Sink Connector
2:01 - Add db records for Streaming
2:08 - S3 Bucket
2:30 - Trino Create External Table
3:03 - Create Iceberg Table & Insert Data
3:38 - Iceberg DML Opertations - Delete
4:13 - Iceberg Schema Evolution
5:15 - Time Travel
6:07 - Rollback
6:50 - Summary & Recap
Доступные форматы для скачивания:
Скачать видео mp4
-
Информация по загрузке: