Популярное

Музыка Кино и Анимация Автомобили Животные Спорт Путешествия Игры Юмор

Интересные видео

2025 Сериалы Трейлеры Новости Как сделать Видеоуроки Diy своими руками

Топ запросов

смотреть а4 schoolboy runaway турецкий сериал смотреть мультфильмы эдисон
dTub
Скачать

Building Your Own Data Pipeline Tool From Scratch - Should You Do It?

Автор: Seattle Data Guy

Загружено: 2024-11-12

Просмотров: 4666

Описание:

Alright, let's start out with the fact that there are some distinctions between an orchestrator and a data pipeline tool.

But for many data teams, they use Airflow as a tool to either act as their data pipeline tool, or the tool that orchestrates all the other tools that make up their data pipeline.

As you start building your first data pipelines, you’ll slowly realize you need to address a growing number of recurring issues. Maybe you implement a component or process that tracks what jobs are running, a scheduler, a set of generic scripts to run transforms and data ingestion, or even some form of UI.

Before you know it, you’ve pieced together something that looks like Airflow. Something that goes beyond just being a set of data pipelines but starts looking like an orchestrator.

Surprisingly (or maybe not), I’ve seen countless homegrown orchestration/data pipeline systems. Often, it feels like, given enough time, the team might build its own Airflow-esque solution.

So should you build it?

If you prefer reading, here is a written version of this - https://seattledataguy.substack.com/p...

Also, if you're looking for an orchestrator, consider checking out Mage!
https://www.mage.ai/

If you'd like to read up on my updates about the data field, then you can sign up for our newsletter here.

https://seattledataguy.substack.com/​​

Or check out my blog
https://www.theseattledataguy.com/

And if you want to support the channel, then you can become a paid member of my newsletter
https://seattledataguy.substack.com/s...


Tags: Data engineering projects, Data engineer project ideas, data project sources, data analytics project sources, data project portfolio

_____________________________________________________________
Subscribe:    / @seattledataguy  
_____________________________________________________________
About me:
I have spent my career focused on all forms of data. I have focused on developing algorithms to detect fraud, reduce patient readmission and redesign insurance provider policy to help reduce the overall cost of healthcare. I have also helped develop analytics for marketing and IT operations in order to optimize limited resources such as employees and budget. I privately consult on data science and engineering problems both solo as well as with a company called Acheron Analytics. I have experience both working hands-on with technical problems as well as helping leadership teams develop strategies to maximize their data.

*I do participate in affiliate programs, if a link has an "*" by it, then I may receive a small portion of the proceeds at no extra cost to you.

Building Your Own Data Pipeline Tool From Scratch - Should You Do It?

Поделиться в:

Доступные форматы для скачивания:

Скачать видео mp4

  • Информация по загрузке:

Скачать аудио mp3

Похожие видео

Databases Vs Data Warehouses Vs Data Lakes - What Is The Difference And Why Should You Care?

Databases Vs Data Warehouses Vs Data Lakes - What Is The Difference And Why Should You Care?

What Is A Data Platform And Why You Should Build One

What Is A Data Platform And Why You Should Build One

Apache Iceberg: что это такое и почему все о нем говорят.

Apache Iceberg: что это такое и почему все о нем говорят.

Build A Data Stack That Lasts - How To Ensure Your Data Infrastructure Is Maintainable

Build A Data Stack That Lasts - How To Ensure Your Data Infrastructure Is Maintainable

Why Every Data Engineer Needs to Master DBT on Databricks RIGHT NOW!

Why Every Data Engineer Needs to Master DBT on Databricks RIGHT NOW!

Data Architects Vs Data Engineers - Is There A Difference?

Data Architects Vs Data Engineers - Is There A Difference?

End to end ETL pipeline project using Docker, Airflow, PostgresDB and Metabase | Data Engineering

End to end ETL pipeline project using Docker, Airflow, PostgresDB and Metabase | Data Engineering

Code along - build an ELT Pipeline in 1 Hour (dbt, Snowflake, Airflow)

Code along - build an ELT Pipeline in 1 Hour (dbt, Snowflake, Airflow)

Understanding the AI Data Pipeline

Understanding the AI Data Pipeline

Getting Started with Prefect | Task Orchestration & Data Workflows

Getting Started with Prefect | Task Orchestration & Data Workflows

What Is A Data Pipeline - Data Engineering 101 (FT.  Alexey from @DataTalksClub  )

What Is A Data Pipeline - Data Engineering 101 (FT. Alexey from @DataTalksClub )

Data Modeling - Why Data Engineers Need To Understand It - An Introduction To Data Engineering

Data Modeling - Why Data Engineers Need To Understand It - An Introduction To Data Engineering

Какие инструменты вам следует знать как инженеру данных?

Какие инструменты вам следует знать как инженеру данных?

7 правил кодинга с ИИ для гордого сеньора

7 правил кодинга с ИИ для гордого сеньора

Моделирование данных: одна большая таблица, Кимбалл и реляционные модели для инженеров данных

Моделирование данных: одна большая таблица, Кимбалл и реляционные модели для инженеров данных

How He Got $600,000 Data Engineer Job

How He Got $600,000 Data Engineer Job

Зачем вам нужна оркестровка данных

Зачем вам нужна оркестровка данных

What Is The Modern Data Stack - Intro To Data Infrastructure Part 1

What Is The Modern Data Stack - Intro To Data Infrastructure Part 1

Data Lake Modeling: 100 TBs into 5 TBs at Airbnb with Parquet + Run Length Encoding - DataExpert.io

Data Lake Modeling: 100 TBs into 5 TBs at Airbnb with Parquet + Run Length Encoding - DataExpert.io

What Is Snowflake - Breaking Down What Snowflake Is, How Snowflake Credits Work And More

What Is Snowflake - Breaking Down What Snowflake Is, How Snowflake Credits Work And More

© 2025 dtub. Все права защищены.



  • Контакты
  • О нас
  • Политика конфиденциальности



Контакты для правообладателей: [email protected]