Airflow Programming Model: Difference between revisions
Jump to navigation
Jump to search
Line 12: | Line 12: | ||
=Declaring a DAG= | =Declaring a DAG= | ||
==With <tt>@dag</tt> Decorator== | ==With <tt>@dag</tt> Decorator== | ||
The <code>@dag</code> decorator turns a Python function into a '''DAG generator function'''. <code>@dag</code> is only available in Airflow 2 and newer. The decorator can be configured with the a [[Airflow_Concepts#DAG_Configuration_Parameters|number of pre-defined parameters]]. | The <code>@dag</code> decorator turns a Python function into a '''DAG generator function'''. <code>@dag</code> is only available in Airflow 2 and newer. The decorator can be configured with the a [[Airflow_Concepts#DAG_Configuration_Parameters|number of pre-defined parameters]]. The function includes task declaration and task relationship declaration, as follows: | ||
<syntaxhighlight lang='py'> | |||
</syntaxhighlight> | |||
Revision as of 01:46, 16 July 2022
Internal
Overview
Airflow DAG Examples
requirements.txt
apache-airflow == 2.3.3
Declaring a DAG
With @dag Decorator
The @dag
decorator turns a Python function into a DAG generator function. @dag
is only available in Airflow 2 and newer. The decorator can be configured with the a number of pre-defined parameters. The function includes task declaration and task relationship declaration, as follows:
Also see:
TO PARSE:
- https://airflow.apache.org/docs/apache-airflow/stable/concepts/dags.html#the-dag-decorator
- https://airflow.apache.org/docs/apache-airflow/2.0.0/concepts.html#dag-decorator.
With DAG() Constructor
Also see:
import time
from datetime import datetime, timedelta
from airflow import DAG
from airflow.operators.empty import EmptyOperator
from airflow.operators.python import PythonOperator
default_args = {
'owner': 'somebody',
"depends_on_past": False,
"email": ["somebody@example.com"],
"email_on_failure": True,
"email_on_retry": False
}
dag = DAG('some_dag',
max_active_runs=1,
catchup=True,
start_date=datetime(2022, 7, 13, 0),
schedule_interval="* * * * *",
dagrun_timeout=timedelta(minutes=2),
default_args=default_args
)
def some_function():
print("something")
with dag:
start = EmptyOperator(task_id='start', dag=dag)
end = EmptyOperator(task_id='end', dag=dag)
some_function = PythonOperator(
task_id='some_function',
python_callable=some_function,
dag=dag
)
start >> some_function >> end
Declaring a Task
Using the PythonOperator
Using the @task Decorator
Module Management
TO PROCESS: https://airflow.apache.org/docs/apache-airflow/2.3.2/modules_management.html?highlight=import