You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "Yajun Dong (JIRA)" <ji...@apache.org> on 2017/03/02 03:32:45 UTC
[jira] [Created] (AIRFLOW-934) airflow delayed the task to start
Yajun Dong created AIRFLOW-934:
----------------------------------
Summary: airflow delayed the task to start
Key: AIRFLOW-934
URL: https://issues.apache.org/jira/browse/AIRFLOW-934
Project: Apache Airflow
Issue Type: Bug
Components: DagRun
Affects Versions: Airflow 1.6.2
Reporter: Yajun Dong
Priority: Blocker
we have a complex DAG which includes many tasks. but recently we found some tasks start delayed. for instance:
start_task(which will start at 00:00) --> create_cluster(will finished at 00:11) --> wait_task(start at 00:16) ,
note:
1. wait_task only has one upstream that is create_cluster
2. the server that the airflow hosts has enough memory, and celeryd_concurrency is 20.
below is the log of wait_task:
[2017-03-02 00:16:39,602] {models.py:124} INFO - Filling up the DagBag from /home/ubuntu/airflow/dags/etl_prod/etl_prod.py
[2017-03-02 00:16:39,603] {models.py:197} INFO - Importing /home/ubuntu/airflow/dags/etl_prod/etl_prod.py
[2017-03-02 00:16:39,613] {models.py:284} INFO - Loaded DAG <DAG: etl_prod>
[2017-03-02 00:16:40,333] {models.py:124} INFO - Filling up the DagBag from /home/ubuntu/airflow/dags/etl_prod/etl_prod.py
[2017-03-02 00:16:40,333] {models.py:197} INFO - Importing /home/ubuntu/airflow/dags/etl_prod/etl_prod.py
[2017-03-02 00:16:40,345] {models.py:284} INFO - Loaded DAG <DAG: etl_prod>
[2017-03-02 00:16:40,373] {models.py:936} INFO -
--------------------------------------------------------------------------------
New run starting @2017-03-02T00:16:40.369560
--------------------------------------------------------------------------------
[2017-03-02 00:16:40,402] {models.py:951} INFO - Queuing into pool None
[2017-03-02 00:22:31,161] {models.py:124} INFO - Filling up the DagBag from /home/ubuntu/airflow/dags/etl_prod/etl_prod.py
[2017-03-02 00:22:31,162] {models.py:197} INFO - Importing /home/ubuntu/airflow/dags/etl_prod/etl_prod.py
[2017-03-02 00:22:31,172] {models.py:284} INFO - Loaded DAG <DAG: etl_prod>
[2017-03-02 00:22:31,863] {models.py:124} INFO - Filling up the DagBag from /home/ubuntu/airflow/dags/etl_prod/etl_prod.py
[2017-03-02 00:22:31,863] {models.py:197} INFO - Importing /home/ubuntu/airflow/dags/etl_prod/etl_prod.py
[2017-03-02 00:22:31,874] {models.py:284} INFO - Loaded DAG <DAG: etl_prod>
[2017-03-02 00:22:31,901] {models.py:936} INFO -
--------------------------------------------------------------------------------
New run starting @2017-03-02T00:22:31.897547
--------------------------------------------------------------------------------
[2017-03-02 00:22:31,911] {models.py:974} INFO - Executing <Task(BashOperator): wait_mins> on 2017-03-01 00:00:00
[2017-03-02 00:22:31,922] {bash_operator.py:52} INFO - tmp dir root location:
/tmp
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)