You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "Kaxil Naik (JIRA)" <ji...@apache.org> on 2019/01/09 21:23:00 UTC

[jira] [Updated] (AIRFLOW-2814) Default Arg "file_process_interval" for class SchedulerJob is inconsistent with doc

     [ https://issues.apache.org/jira/browse/AIRFLOW-2814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kaxil Naik updated AIRFLOW-2814:
--------------------------------
    Fix Version/s:     (was: 2.0.0)
                   1.10.2

> Default Arg "file_process_interval" for class SchedulerJob is inconsistent with doc
> -----------------------------------------------------------------------------------
>
>                 Key: AIRFLOW-2814
>                 URL: https://issues.apache.org/jira/browse/AIRFLOW-2814
>             Project: Apache Airflow
>          Issue Type: Bug
>          Components: scheduler
>            Reporter: Xiaodong DENG
>            Assignee: Xiaodong DENG
>            Priority: Critical
>             Fix For: 1.10.2
>
>
> h2. Backgrond
> In [https://github.com/XD-DENG/incubator-airflow/blob/master/airflow/jobs.py#L592] , it was mentioned the default value of argument *file_process_interval* should be 3 minutes (*file_process_interval:* Parse and schedule each file no faster than this interval).
> The value is normally parsed from the default configuration. However, in the default config_template, its value is 0 rather than 180 seconds ([https://github.com/XD-DENG/incubator-airflow/blob/master/airflow/config_templates/default_airflow.cfg#L432] ). 
> h2. Issue
> This means that actually that each file is parsed and scheduled without letting Airflow "rest". This conflicts with the design purpose (by default let it be 180 seconds) and may affect performance significantly.
> h2. My Proposal
> Change the value in the config template from 0 to 180.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)