You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "Brian King (JIRA)" <ji...@apache.org> on 2019/07/15 10:48:00 UTC
[jira] [Created] (AIRFLOW-4958) Documentation issue with cron
schedule_interval
Brian King created AIRFLOW-4958:
-----------------------------------
Summary: Documentation issue with cron schedule_interval
Key: AIRFLOW-4958
URL: https://issues.apache.org/jira/browse/AIRFLOW-4958
Project: Apache Airflow
Issue Type: Bug
Components: documentation, scheduler
Affects Versions: 1.10.3
Reporter: Brian King
The docs regarding scheduling with a cron expression ( [https://airflow.apache.org/scheduler.html#dag-runs] ) links to a wikipedia article on cron ( [https://en.wikipedia.org/wiki/Cron#CRON_expression] ), which says that the expression is comprised of 5 or 6 fields, with the last field being the year.
However, croniter, which is used by Airflow, treats the 6th field as seconds ( [https://github.com/taichino/croniter/issues/76#issuecomment-332508039] ).
Perhaps the link to cron documentation should link to the croniter documentation instead of the Wikipedia article, or the Airflow documentation should make clear that the 6th field is optional, and if used, it represents seconds.
I had a quick-running job (that finished in less than a minute) that would execute twice a day. The DAG was configured with schedule_interval = "10 2 * * * *" and max_active_runs = 1. The first job run started around 2:10:00, the second one started around 2:10:45. Removing the sixth field resulted in the job only running once per day, as I wanted it to do.
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)