Airflow Concepts: Difference between revisions
Jump to navigation
Jump to search
(→Task) |
|||
Line 27: | Line 27: | ||
===<span id='Bjy4L'></span>TaskFlow-decorated Task=== | ===<span id='Bjy4L'></span>TaskFlow-decorated Task=== | ||
{{External|https://airflow.apache.org/docs/apache-airflow/stable/concepts/taskflow.html}} | {{External|https://airflow.apache.org/docs/apache-airflow/stable/concepts/taskflow.html}} | ||
==Task Assignment to DAG== | |||
{{External|https://airflow.apache.org/docs/apache-airflow/stable/concepts/dags.html#dag-assignment}} | |||
==Passing Data between Tasks== | ==Passing Data between Tasks== |
Revision as of 01:58, 11 July 2022
External
Internal
Workflow
DAG
SubDAG
A DAG is made of tasks among which there are relations of dependency. The DAG is not concerned about what happens inside the tasks, it is only concerned about how to run them: order, retries, timeouts. etc.
Task
Every task must be assigned to a DAG to run. Tasks have dependencies on each other.
Task Dependencies
If a task B has a dependency on task A (A → B), it is said that A is upstream of B and B is downstream of A. The dependencies are the directed edges of the directed acyclic graph.
Task Types
Airflow has three types of tasks: Operator, Sensor, which is a subclass of Operator, and TaskFlow-decorated Task.
Operator
Sensor
A Sensor is a subclass of Operator.
TaskFlow-decorated Task
Task Assignment to DAG
Passing Data between Tasks
Tasks pass data among each other using:
- XComs, when the amount of metadata to be exchanged is small.
- Uploading and downloading large files from a storage service.
TaskGroup
XComs
"Cross-communications".