You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "kasim (Jira)" <ji...@apache.org> on 2019/10/28 04:27:00 UTC

[jira] [Created] (AIRFLOW-5795) Airflow cached old Code and Veriables

kasim created AIRFLOW-5795:
------------------------------

             Summary: Airflow cached old Code and Veriables 
                 Key: AIRFLOW-5795
                 URL: https://issues.apache.org/jira/browse/AIRFLOW-5795
             Project: Apache Airflow
          Issue Type: Bug
          Components: DAG, database
    Affects Versions: 1.10.3
            Reporter: kasim


My dag start_date is configed in Variables, 

 
{code:java}
from datetime import datetime
from airflow.models import Variable

class Config(object):    
    version = "V21"
    
    dag_start_date = datetime(2019, 10, 25)    
    sf_schedule_report = "30 8 * * *"
    sf_schedule_etl = '30 1 * * *'
    sf_schedule_main = "45 3,4,5,6,7,8 * * *"


CONFIG_KEY = 'sf_config_%s' % Config.version

sf_config = Variable.get(CONFIG_KEY, deserialize_json=True, default_var={})

if sf_config:
    for k, v in sf_config.items():
        print(f'Overwrite {k} by {v}')
        if hasattr(Config, k):
            if k == 'dag_start_date':
                setattr(Config, k, datetime.strptime(v, '%Y-%m-%d') )
            else:
                setattr(Config, k, v)

print('#### config ### \n\n')

print(f'CONFIG_KEY: {CONFIG_KEY}')
print(f'CONFIG DICT: {sf_config}')

print('#### config end ### \n\n')
{code}
My dag init as :
{code:java}
dag = DAG('dm_sf_etl_%s' % Config.version, 
    start_date=Config.dag_start_date,
    default_args=default_args, 
    schedule_interval=schedule,
    user_defined_filters={
        'mod' : lambda s, d:s%d
    },
)
{code}
Variables is :

!image-2019-10-28-12-25-56-300.png!

 

 

But the log shows:
 * airflow tried 4 times
 * it didn't read Variable at first time
 * it use somewhere cached start start_date `2019-10-17` ,  schedule: 45 1,2,3,4,5,6 * * *   (This is old settings on Variable 10 days ago )
 * it did read new Variable on later attampt

 

I have tried delete files, delete dag on webui, delete related task_instace in database. But still same error .

I think there are some cache in scheduler memory cache , I can't restart airflow , so can't confirm. But this shouldn't happen.

 
{code:java}
*** Reading local file: /data/opt/workflow/airflow/logs/dm_sf_main_V21/branch_task/2019-10-17T01:45:00+08:00/1.log
[2019-10-17 02:45:15,628] {__init__.py:1139} INFO - Dependencies all met for <TaskInstance: dm_sf_main_V21.branch_task 2019-10-17T01:45:00+08:00 [queued]>
[2019-10-17 02:45:15,644] {__init__.py:1139} INFO - Dependencies all met for <TaskInstance: dm_sf_main_V21.branch_task 2019-10-17T01:45:00+08:00 [queued]>
[2019-10-17 02:45:15,644] {__init__.py:1353} INFO - 
--------------------------------------------------------------------------------
[2019-10-17 02:45:15,645] {__init__.py:1354} INFO - Starting attempt 1 of 1
[2019-10-17 02:45:15,645] {__init__.py:1355} INFO - 
--------------------------------------------------------------------------------
[2019-10-17 02:45:15,702] {__init__.py:1374} INFO - Executing <Task(BranchPythonOperator): branch_task> on 2019-10-17T01:45:00+08:00
[2019-10-17 02:45:15,702] {base_task_runner.py:119} INFO - Running: ['airflow', 'run', 'dm_sf_main_V21', 'branch_task', '2019-10-17T01:45:00+08:00', '--job_id', '154142', '--raw', '-sd', 'DAGS_FOLDER/sf_dags_n/dm_sf_main.py', '--cfg_path', '/tmp/tmp4p1ml6pq']
[2019-10-17 02:45:16,180] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task [2019-10-17 02:45:16,179] {settings.py:182} INFO - settings.configure_orm(): Using pool settings. pool_size=5, pool_recycle=1800, pid=17786
[2019-10-17 02:45:16,341] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task [2019-10-17 02:45:16,340] {__init__.py:51} INFO - Using executor CeleryExecutor
[2019-10-17 02:45:16,643] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task [2019-10-17 02:45:16,642] {__init__.py:305} INFO - Filling up the DagBag from /opt/workflow/airflow/dags/sf_dags_n/dm_sf_main.py
[2019-10-17 02:45:16,704] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task 
[2019-10-17 02:45:16,704] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task schedule: 45 1,2,3,4,5,6 * * *
[2019-10-17 02:45:16,704] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task divisor: 5
[2019-10-17 02:45:16,704] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task 
[2019-10-17 02:45:16,704] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task ###  found sales_forecast_prophet in operator_pyspark_conf, overwrite default conf !
[2019-10-17 02:45:16,704] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task 
[2019-10-17 02:45:16,704] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task     ## Job[sales_forecast_prophet] Command-line :
[2019-10-17 02:45:16,704] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task     -------
[2019-10-17 02:45:16,704] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task     
[2019-10-17 02:45:16,704] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task     spark-submit    xxxxx
[2019-10-17 02:45:16,704] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task 
[2019-10-17 02:45:16,704] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task     -------
[2019-10-17 02:45:16,704] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task     
[2019-10-17 02:45:16,704] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task ###  found sales_forecast_prophet in operator_pyspark_conf, overwrite default conf !
[2019-10-17 02:45:16,705] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task 
[2019-10-17 02:45:16,705] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task     ## Job[sales_forecast_prophet] Command-line :
[2019-10-17 02:45:16,705] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task     -------
[2019-10-17 02:45:16,705] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task     
[2019-10-17 02:45:16,705] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task     spark-submit    xxxxx
[2019-10-17 02:45:16,705] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task 
[2019-10-17 02:45:16,705] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task     -------
[2019-10-17 02:45:16,705] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task     
[2019-10-17 02:45:16,705] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task Config.forecast_output_path: /xxxxx/data/dm/sales_forecast/results/fbprophet/version=V21/cur_time={{ execution_date.strftime('%Y%m%d') }}
[2019-10-17 02:45:16,705] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task Config.s3_forecast_output_path: s3_path/data/dm/sales_forecast/fbprophet/version=V21/cur_time={{ execution_date.strftime('%Y%m%d') }}
[2019-10-17 02:45:16,705] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task export HADOOP_USER_NAME=hdfs & kinit -kt /etc/security/hdfs.keytab hdfs & hadoop distcp -update -delete /xxxxx/data/dm/sales_forecast/results/fbprophet/version=V21/cur_time={{ execution_date.strftime('%Y%m%d') }} s3_path/data/dm/sales_forecast/fbprophet/version=V21/cur_time={{ execution_date.strftime('%Y%m%d') }}
[2019-10-17 02:45:16,705] {base_task_runner.py:101} INFO - Job 154142: Subtask branch_task [2019-10-17 02:45:16,704] {cli.py:517} INFO - Running <TaskInstance: dm_sf_main_V21.branch_task 2019-10-17T01:45:00+08:00 [running]> on host dc36
[2019-10-17 02:45:16,728] {python_operator.py:104} INFO - Exporting the following env vars:
AIRFLOW_CTX_DAG_ID=dm_sf_main_V21
AIRFLOW_CTX_TASK_ID=branch_task
AIRFLOW_CTX_EXECUTION_DATE=2019-10-17T01:45:00+08:00
AIRFLOW_CTX_DAG_RUN_ID=scheduled__2019-10-17T01:45:00+08:00
[2019-10-17 02:45:16,728] {logging_mixin.py:95} INFO - ti.execution_date.hour: 1
[2019-10-17 02:45:16,728] {python_operator.py:113} INFO - Done. Returned value was: sales_forecast_prophet_V21
[2019-10-17 02:45:16,728] {python_operator.py:143} INFO - Following branch ['sales_forecast_prophet_V21']
[2019-10-17 02:45:16,729] {python_operator.py:144} INFO - Marking other directly downstream tasks as skipped
[2019-10-17 02:45:16,828] {python_operator.py:163} INFO - Done.
[2019-10-17 02:45:20,634] {logging_mixin.py:95} INFO - [2019-10-17 02:45:20,633] {jobs.py:2562} INFO - Task exited with return code 0
[2019-10-27 19:16:28,950] {__init__.py:1139} INFO - Dependencies all met for <TaskInstance: dm_sf_main_V21.branch_task 2019-10-17T01:45:00+08:00 [queued]>
[2019-10-27 19:16:28,960] {__init__.py:1139} INFO - Dependencies all met for <TaskInstance: dm_sf_main_V21.branch_task 2019-10-17T01:45:00+08:00 [queued]>
[2019-10-27 19:16:28,960] {__init__.py:1353} INFO - 
--------------------------------------------------------------------------------
[2019-10-27 19:16:28,960] {__init__.py:1354} INFO - Starting attempt 1 of 1
[2019-10-27 19:16:28,961] {__init__.py:1355} INFO - 
--------------------------------------------------------------------------------
[2019-10-27 19:16:29,035] {__init__.py:1374} INFO - Executing <Task(BranchPythonOperator): branch_task> on 2019-10-17T01:45:00+08:00
[2019-10-27 19:16:29,036] {base_task_runner.py:119} INFO - Running: ['airflow', 'run', 'dm_sf_main_V21', 'branch_task', '2019-10-17T01:45:00+08:00', '--job_id', '421646', '--raw', '-sd', 'DAGS_FOLDER/sf_dags_n/dm_sf_main.py', '--cfg_path', '/tmp/tmpn1ew5nup']
[2019-10-27 19:16:29,511] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task [2019-10-27 19:16:29,511] {settings.py:182} INFO - settings.configure_orm(): Using pool settings. pool_size=5, pool_recycle=1800, pid=28949
[2019-10-27 19:16:29,680] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task [2019-10-27 19:16:29,679] {__init__.py:51} INFO - Using executor CeleryExecutor
[2019-10-27 19:16:29,961] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task [2019-10-27 19:16:29,959] {__init__.py:305} INFO - Filling up the DagBag from /opt/workflow/airflow/dags/sf_dags_n/dm_sf_main.py
[2019-10-27 19:16:30,052] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task Overwrite dag_start_date by 2019-10-25
[2019-10-27 19:16:30,053] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task Overwrite version by V21
[2019-10-27 19:16:30,053] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task Overwrite sf_schedule_report by 30 9 * * *
[2019-10-27 19:16:30,053] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task Overwrite sf_schedule_etl by 40 2 * * *
[2019-10-27 19:16:30,053] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task Overwrite sf_schedule_main by 45 2,3,4,5,6,7 * * *
[2019-10-27 19:16:30,054] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task #### config ### 
[2019-10-27 19:16:30,054] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task 
[2019-10-27 19:16:30,054] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task 
[2019-10-27 19:16:30,054] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task CONFIG_KEY: sf_config_V21
[2019-10-27 19:16:30,054] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task CONFIG DICT: {'dag_start_date': '2019-10-25', 'version': 'V21', 'sf_schedule_report': '30 9 * * *', 'sf_schedule_etl': '40 2 * * *', 'sf_schedule_main': '45 2,3,4,5,6,7 * * *'}
[2019-10-27 19:16:30,054] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task #### config end ### 
[2019-10-27 19:16:30,054] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task 
[2019-10-27 19:16:30,054] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task 
[2019-10-27 19:16:30,055] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task 
[2019-10-27 19:16:30,055] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task schedule: 45 2,3,4,5,6,7 * * *
[2019-10-27 19:16:30,055] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task divisor: 5
[2019-10-27 19:16:30,055] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task 
[2019-10-27 19:16:30,055] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task ###  found sales_forecast_prophet in operator_pyspark_conf, overwrite default conf !
[2019-10-27 19:16:30,055] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task 
[2019-10-27 19:16:30,055] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task     ## Job[sales_forecast_prophet] Command-line :
[2019-10-27 19:16:30,055] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task     -------
[2019-10-27 19:16:30,055] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task     
[2019-10-27 19:16:30,055] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task     spark-submit      xxxxx
[2019-10-27 19:16:30,056] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task 
[2019-10-27 19:16:30,056] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task     -------
[2019-10-27 19:16:30,056] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task     
[2019-10-27 19:16:30,056] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task ###  found sales_forecast_prophet in operator_pyspark_conf, overwrite default conf !
[2019-10-27 19:16:30,056] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task 
[2019-10-27 19:16:30,056] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task     ## Job[sales_forecast_prophet] Command-line :
[2019-10-27 19:16:30,056] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task     -------
[2019-10-27 19:16:30,056] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task     
[2019-10-27 19:16:30,056] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task     spark-submit     xxxxx
[2019-10-27 19:16:30,057] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task 
[2019-10-27 19:16:30,057] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task     -------
[2019-10-27 19:16:30,057] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task     
[2019-10-27 19:16:30,057] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task export HADOOP_USER_NAME=hdfs && kinit -kt /etc/security/hdfs.keytab hdfs && hadoop distcp -update -delete /xxxxx/data/dm/sales_forecast/results/fbprophet/version=V21/cur_time={{ execution_date.strftime('%Y%m%d') }} s3_path/data/dm/sales_forecast/fbprophet/version=V21/cur_time={{ execution_date.strftime('%Y%m%d') }}
[2019-10-27 19:16:30,057] {base_task_runner.py:101} INFO - Job 421646: Subtask branch_task [2019-10-27 19:16:30,051] {cli.py:517} INFO - Running <TaskInstance: dm_sf_main_V21.branch_task 2019-10-17T01:45:00+08:00 [running]> on host dc36
[2019-10-27 19:16:30,086] {python_operator.py:104} INFO - Exporting the following env vars:
AIRFLOW_CTX_DAG_ID=dm_sf_main_V21
AIRFLOW_CTX_TASK_ID=branch_task
AIRFLOW_CTX_EXECUTION_DATE=2019-10-17T01:45:00+08:00
AIRFLOW_CTX_DAG_RUN_ID=scheduled__2019-10-17T01:45:00+08:00
[2019-10-27 19:16:30,086] {logging_mixin.py:95} INFO - ti.execution_date.hour: 1
[2019-10-27 19:16:30,086] {python_operator.py:113} INFO - Done. Returned value was: sales_forecast_prophet
[2019-10-27 19:16:30,086] {python_operator.py:143} INFO - Following branch ['sales_forecast_prophet']
[2019-10-27 19:16:30,087] {python_operator.py:144} INFO - Marking other directly downstream tasks as skipped
[2019-10-27 19:16:30,245] {python_operator.py:163} INFO - Done.
[2019-10-27 19:16:33,935] {logging_mixin.py:95} INFO - [2019-10-27 19:16:33,933] {jobs.py:2562} INFO - Task exited with return code 0
[2019-10-27 22:59:50,913] {__init__.py:1139} INFO - Dependencies all met for <TaskInstance: dm_sf_main_V21.branch_task 2019-10-17T01:45:00+08:00 [queued]>
[2019-10-27 22:59:50,927] {__init__.py:1139} INFO - Dependencies all met for <TaskInstance: dm_sf_main_V21.branch_task 2019-10-17T01:45:00+08:00 [queued]>
[2019-10-27 22:59:50,927] {__init__.py:1353} INFO - 
--------------------------------------------------------------------------------
[2019-10-27 22:59:50,928] {__init__.py:1354} INFO - Starting attempt 1 of 1
[2019-10-27 22:59:50,928] {__init__.py:1355} INFO - 
--------------------------------------------------------------------------------
[2019-10-27 22:59:50,983] {__init__.py:1374} INFO - Executing <Task(BranchPythonOperator): branch_task> on 2019-10-17T01:45:00+08:00
[2019-10-27 22:59:50,984] {base_task_runner.py:119} INFO - Running: ['airflow', 'run', 'dm_sf_main_V21', 'branch_task', '2019-10-17T01:45:00+08:00', '--job_id', '428425', '--raw', '-sd', 'DAGS_FOLDER/sf_dags_n/dm_sf_main.py', '--cfg_path', '/tmp/tmperygo54p']
[2019-10-27 22:59:51,432] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task [2019-10-27 22:59:51,431] {settings.py:182} INFO - settings.configure_orm(): Using pool settings. pool_size=5, pool_recycle=1800, pid=15555
[2019-10-27 22:59:51,584] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task [2019-10-27 22:59:51,584] {__init__.py:51} INFO - Using executor CeleryExecutor
[2019-10-27 22:59:51,868] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task [2019-10-27 22:59:51,866] {__init__.py:305} INFO - Filling up the DagBag from /opt/workflow/airflow/dags/sf_dags_n/dm_sf_main.py
[2019-10-27 22:59:51,955] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task Overwrite dag_start_date by 2019-10-25
[2019-10-27 22:59:51,957] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task Overwrite sf_schedule_report by 30 9 * * *
[2019-10-27 22:59:51,957] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task Overwrite sf_schedule_etl by 40 2 * * *
[2019-10-27 22:59:51,957] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task Overwrite sf_schedule_main by 45 2,3,4,5,6,7 * * *
[2019-10-27 22:59:51,957] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task #### config ### 
[2019-10-27 22:59:51,957] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task 
[2019-10-27 22:59:51,957] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task 
[2019-10-27 22:59:51,958] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task CONFIG_KEY: sf_config_V21
[2019-10-27 22:59:51,958] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task CONFIG DICT: {'dag_start_date': '2019-10-25', 'xxxx': 'xxxxxx', 'sf_schedule_report': '30 9 * * *', 'sf_schedule_etl': '40 2 * * *', 'sf_schedule_main': '45 2,3,4,5,6,7 * * *'}
[2019-10-27 22:59:51,958] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task #### config end ### 
[2019-10-27 22:59:51,958] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task 
[2019-10-27 22:59:51,958] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task 
[2019-10-27 22:59:51,958] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task 
[2019-10-27 22:59:51,958] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task schedule: 45 2,3,4,5,6,7 * * *
[2019-10-27 22:59:51,958] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task divisor: 5
[2019-10-27 22:59:51,958] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task 
[2019-10-27 22:59:51,958] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task ###  found sales_forecast_prophet in operator_pyspark_conf, overwrite default conf !
[2019-10-27 22:59:51,959] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task 
[2019-10-27 22:59:51,959] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task     ## Job[sales_forecast_prophet] Command-line :
[2019-10-27 22:59:51,959] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task     -------
[2019-10-27 22:59:51,959] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task     
[2019-10-27 22:59:51,959] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task     spark-submit   xxxxx
[2019-10-27 22:59:51,959] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task 
[2019-10-27 22:59:51,959] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task     -------
[2019-10-27 22:59:51,959] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task     
[2019-10-27 22:59:51,959] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task ###  found sales_forecast_prophet in operator_pyspark_conf, overwrite default conf !
[2019-10-27 22:59:51,960] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task 
[2019-10-27 22:59:51,960] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task     ## Job[sales_forecast_prophet] Command-line :
[2019-10-27 22:59:51,960] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task     -------
[2019-10-27 22:59:51,960] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task     
[2019-10-27 22:59:51,960] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task     spark-submit   xxxxxx
[2019-10-27 22:59:51,960] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task 
[2019-10-27 22:59:51,960] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task     -------
[2019-10-27 22:59:51,960] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task     
[2019-10-27 22:59:51,960] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task export HADOOP_USER_NAME=hdfs && kinit -kt /etc/security/hdfs.keytab hdfs && hadoop distcp -update -delete /xxxxx/data/dm/sales_forecast/results/fbprophet/version=V21/cur_time={{ execution_date.strftime('%Y%m%d') }} s3_path/data/dm/sales_forecast/fbprophet/version=V21/cur_time={{ execution_date.strftime('%Y%m%d') }}
[2019-10-27 22:59:51,960] {base_task_runner.py:101} INFO - Job 428425: Subtask branch_task [2019-10-27 22:59:51,955] {cli.py:517} INFO - Running <TaskInstance: dm_sf_main_V21.branch_task 2019-10-17T01:45:00+08:00 [running]> on host dc36
[2019-10-27 22:59:51,992] {python_operator.py:104} INFO - Exporting the following env vars:
AIRFLOW_CTX_DAG_ID=dm_sf_main_V21
AIRFLOW_CTX_TASK_ID=branch_task
AIRFLOW_CTX_EXECUTION_DATE=2019-10-17T01:45:00+08:00
AIRFLOW_CTX_DAG_RUN_ID=scheduled__2019-10-17T01:45:00+08:00
[2019-10-27 22:59:51,992] {logging_mixin.py:95} INFO - ti.execution_date.hour: 1
[2019-10-27 22:59:51,992] {python_operator.py:113} INFO - Done. Returned value was: sales_forecast_prophet
[2019-10-27 22:59:51,992] {python_operator.py:143} INFO - Following branch ['sales_forecast_prophet']
[2019-10-27 22:59:51,993] {python_operator.py:144} INFO - Marking other directly downstream tasks as skipped
[2019-10-27 22:59:52,043] {python_operator.py:163} INFO - Done.
[2019-10-27 22:59:56,127] {logging_mixin.py:95} INFO - [2019-10-27 22:59:56,125] {jobs.py:2562} INFO - Task exited with return code 0
[2019-10-28 11:48:58,546] {__init__.py:1139} INFO - Dependencies all met for <TaskInstance: dm_sf_main_V21.branch_task 2019-10-17T01:45:00+08:00 [queued]>
[2019-10-28 11:48:58,589] {__init__.py:1139} INFO - Dependencies all met for <TaskInstance: dm_sf_main_V21.branch_task 2019-10-17T01:45:00+08:00 [queued]>
[2019-10-28 11:48:58,589] {__init__.py:1353} INFO - 
--------------------------------------------------------------------------------
[2019-10-28 11:48:58,590] {__init__.py:1354} INFO - Starting attempt 1 of 1
[2019-10-28 11:48:58,590] {__init__.py:1355} INFO - 
--------------------------------------------------------------------------------
[2019-10-28 11:48:58,646] {__init__.py:1374} INFO - Executing <Task(BranchPythonOperator): branch_task> on 2019-10-17T01:45:00+08:00
[2019-10-28 11:48:58,646] {base_task_runner.py:119} INFO - Running: ['airflow', 'run', 'dm_sf_main_V21', 'branch_task', '2019-10-17T01:45:00+08:00', '--job_id', '449843', '--raw', '-sd', 'DAGS_FOLDER/sf_dags_n/dm_sf_main_V21.py', '--cfg_path', '/tmp/tmp088wf7mr']
[2019-10-28 11:48:59,108] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task [2019-10-28 11:48:59,107] {settings.py:182} INFO - settings.configure_orm(): Using pool settings. pool_size=5, pool_recycle=1800, pid=1312
[2019-10-28 11:48:59,262] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task [2019-10-28 11:48:59,261] {__init__.py:51} INFO - Using executor CeleryExecutor
[2019-10-28 11:48:59,538] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task [2019-10-28 11:48:59,536] {__init__.py:305} INFO - Filling up the DagBag from /opt/workflow/airflow/dags/sf_dags_n/dm_sf_main_V21.py
[2019-10-28 11:48:59,618] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task Overwrite dag_start_date by 2019-10-25
[2019-10-28 11:48:59,620] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task Overwrite sf_schedule_report by 30 9 * * *
[2019-10-28 11:48:59,620] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task Overwrite sf_schedule_etl by 40 2 * * *
[2019-10-28 11:48:59,620] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task Overwrite sf_schedule_main by 45 2,3,4,5,6,7 * * *
[2019-10-28 11:48:59,620] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task #### config ### 
[2019-10-28 11:48:59,620] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task 
[2019-10-28 11:48:59,620] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task 
[2019-10-28 11:48:59,620] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task CONFIG_KEY: sf_config_V21
[2019-10-28 11:48:59,620] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task CONFIG DICT: {'dag_start_date': '2019-10-25', 'xxxxx': 'xxxxx', 'sf_schedule_report': '30 9 * * *', 'sf_schedule_etl': '40 2 * * *', 'sf_schedule_main': '45 2,3,4,5,6,7 * * *'}
[2019-10-28 11:48:59,620] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task #### config end ### 
[2019-10-28 11:48:59,621] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task 
[2019-10-28 11:48:59,621] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task 
[2019-10-28 11:48:59,621] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task 
[2019-10-28 11:48:59,621] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task schedule: 45 2,3,4,5,6,7 * * *
[2019-10-28 11:48:59,621] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task divisor: 5
[2019-10-28 11:48:59,621] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task 
[2019-10-28 11:48:59,621] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task ###  found sales_forecast_prophet in operator_pyspark_conf, overwrite default conf !
[2019-10-28 11:48:59,621] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task 
[2019-10-28 11:48:59,621] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task     ## Job[sales_forecast_prophet] Command-line :
[2019-10-28 11:48:59,622] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task     -------
[2019-10-28 11:48:59,622] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task     
[2019-10-28 11:48:59,622] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task     spark-submit    xxxxxx
[2019-10-28 11:48:59,622] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task 
[2019-10-28 11:48:59,622] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task     -------
[2019-10-28 11:48:59,622] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task     
[2019-10-28 11:48:59,622] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task ###  found sales_forecast_prophet in operator_pyspark_conf, overwrite default conf !
[2019-10-28 11:48:59,622] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task 
[2019-10-28 11:48:59,622] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task     ## Job[sales_forecast_prophet] Command-line :
[2019-10-28 11:48:59,622] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task     -------
[2019-10-28 11:48:59,623] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task     
[2019-10-28 11:48:59,623] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task     spark-submit      xxxxxxx
[2019-10-28 11:48:59,623] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task 
[2019-10-28 11:48:59,623] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task     -------
[2019-10-28 11:48:59,623] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task     
[2019-10-28 11:48:59,623] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task export HADOOP_USER_NAME=hdfs && kinit -kt /etc/security/hdfs.keytab hdfs && hadoop distcp -update -delete /xxxxx/data/dm/sales_forecast/results/fbprophet/version=V21/cur_time={{ execution_date.strftime('%Y%m%d') }} s3_path/data/dm/sales_forecast/fbprophet/version=V21/cur_time={{ execution_date.strftime('%Y%m%d') }}
[2019-10-28 11:48:59,623] {base_task_runner.py:101} INFO - Job 449843: Subtask branch_task [2019-10-28 11:48:59,618] {cli.py:517} INFO - Running <TaskInstance: dm_sf_main_V21.branch_task 2019-10-17T01:45:00+08:00 [running]> on host dc36
[2019-10-28 11:48:59,652] {python_operator.py:104} INFO - Exporting the following env vars:
AIRFLOW_CTX_DAG_ID=dm_sf_main_V21
AIRFLOW_CTX_TASK_ID=branch_task
AIRFLOW_CTX_EXECUTION_DATE=2019-10-17T01:45:00+08:00
AIRFLOW_CTX_DAG_RUN_ID=scheduled__2019-10-17T01:45:00+08:00
[2019-10-28 11:48:59,652] {logging_mixin.py:95} INFO - ti.execution_date.hour: 1
[2019-10-28 11:48:59,652] {python_operator.py:113} INFO - Done. Returned value was: sales_forecast_prophet
[2019-10-28 11:48:59,652] {python_operator.py:143} INFO - Following branch ['sales_forecast_prophet']
[2019-10-28 11:48:59,653] {python_operator.py:144} INFO - Marking other directly downstream tasks as skipped
[2019-10-28 11:48:59,704] {python_operator.py:163} INFO - Done.
[2019-10-28 11:49:03,532] {logging_mixin.py:95} INFO - [2019-10-28 11:49:03,530] {jobs.py:2562} INFO - Task exited with return code 0



{code}
 

 

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)