Carte Data Pipelines with Apache Airflow Bas P. Harenslak

Data Pipelines with Apache Airflow

Limbă: engleză
Legare: Carte broșată
Disponibilitate: În depozitul extern
Expediem în 3-5 zile
359.30 lei
Pipelines can be challenging to manage, especially when your data has to flow through a collection o...

Informații despre carte

Limbă
engleză
Legare
Carte - Carte broșată
Publicat
2021
Pagini
425
EAN
9781617296901
ISBN
1617296902
Enbook ID
33339164
Greutate
888
Dimensiuni
237 x 188 x 30

Descriere completă

Pipelines can be challenging to manage, especially when your data has to flow through a collection of application components, servers, and cloud services. Airflow lets you schedule, restart, and backfill pipelines, and its easy-to-use UI and workflows with Python scripting has users praising its incredible flexibility. Data Pipelines with Apache Airflow takes you through best practices for creating pipelines for multiple tasks, including data lakes, cloud deployments, and data science.

 

Data Pipelines with Apache Airflow teaches you the ins-and-outs of the Directed Acyclic Graphs (DAGs) that power Airflow, and how to write your own DAGs to meet the needs of your projects. With complete coverage of both foundational and lesser-known features, when you’re done you’ll be set to start using Airflow for seamless data pipeline development and management.

 

Key Features

Framework foundation and best practices

Airflow''s execution and dependency system

Testing Airflow DAGs

Running Airflow in production

 

For data-savvy developers, DevOps and data engineers, and system

administrators with intermediate Python skills.

 

About the technology

Data pipelines are used to extract, transform and load data to and from multiple sources, routing it wherever it’s needed -- whether that’s visualisation tools, business intelligence dashboards, or machine learning models. Airflow streamlines the whole process, giving you one tool for programmatically developing and monitoring batch data pipelines, and integrating all the pieces you use in your data stack.

 

Bas Harenslak and Julian de Ruiter are data engineers with extensive experience using Airflow to develop pipelines for major companies including Heineken, Unilever, and Booking.com. Bas is a committer, and both Bas and Julian are active contributors to Apache Airflow.

S-ar putea să te intereseze

268.28 lei
212.26 lei

Kafka in Action

Viktor Gamov
238.15 lei

Kotlin in Action

Dmitry Jemerov
251.39 lei
319.65 lei
189.50 lei

Modern Java in Action

Raoul-Gabriel Urma
359.30 lei

One True Loves

Taylor Jenkins Reid
55.00 lei

Effective Python

Brett Slatkin
305.40 lei
512.81 lei

Spilt Milk

Raphael-Leff Joan
171.00 lei
105.67 lei

2,100 Asanas

Daniel Lacerda
170.79 lei

The Bands of Mourning

Brandon Sanderson
53.08 lei
98.59 lei
309.74 lei
133.07 lei
71.79 lei
53.08 lei
58.75 lei

Clienții care au cumpărat această carte au mai cumpărat și

Python Cookbook

David Beazley
264.24 lei
125.29 lei
312.07 lei
324.10 lei

Learning Spark

Jules Damji
324.10 lei