How to create an AWS S3 Data Lake? | AWS S3 | AWS Glue | AWS Athena
Автор: BI Insights Inc
Загружено: 2022-04-20
Просмотров: 7615
In this video, we create a AWS S3 data lake. A data lake is a centralized, curated, and secured repository storing all your structured and unstructured data, at any scale. You can store your data as-is, without having first to structure it. And you can run different types of analytics to better guide decision-making.
We cover the following topics in this video:
We create a database to house our glue catalog. We define a crawler connects to a data store. This crawler progresses through the S3 objects with a csv classifiers to determine the schema for our data, and then creates metadata tables in our AWS Glue Data Catalog.
Next, query the data in the data lake using Athena. We set a default query output location for AWS Athens so it can store the query output.
In the Athena console, choose Query Editor and select the AdventureWorks database.
Choose Tables and select the Product's table.
Choose Table Options (three vertical dots to the right of the table name).
Select Preview table.
Link to medium article on this topic: https://blog.devgenius.io/build-your-...
Subscribe to our channel:
/ haqnawaz
---------------------------------------------
Follow me on social media!
GitHub: https://github.com/hnawaz007
Instagram: / bi_insights. .
LinkedIn: / haq-nawaz
---------------------------------------------
#AWS #S3 #DataLake
Topics covered in this video:
0:00 - Intro AWS S3 Data Lake and Glue
1:00 - Today's Agenda
1:22 - Create a database
2:05 - Create a crawler
3:53 - Run Crawler to create tables
4:18 - Inspect tables
4:43 - AWS Athena - define output location
5:15 - Query Data Lake in AWS Athena
Доступные форматы для скачивания:
Скачать видео mp4
-
Информация по загрузке: