You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "ASF subversion and git services (Jira)" <ji...@apache.org> on 2020/11/12 21:09:00 UTC

[jira] [Commented] (AIRFLOW-3607) Decreasing scheduler delay between tasks

    [ https://issues.apache.org/jira/browse/AIRFLOW-3607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17230942#comment-17230942 ] 

ASF subversion and git services commented on AIRFLOW-3607:
----------------------------------------------------------

Commit 2482fbe2f6eb2b3cef6a6378ba140ba109ada186 in airflow's branch refs/heads/v1-10-test from amichai07
[ https://gitbox.apache.org/repos/asf?p=airflow.git;h=2482fbe ]

[AIRFLOW-3607] Optimize dep checking when depends on past set and concurrency limit


> Decreasing scheduler delay between tasks
> ----------------------------------------
>
>                 Key: AIRFLOW-3607
>                 URL: https://issues.apache.org/jira/browse/AIRFLOW-3607
>             Project: Apache Airflow
>          Issue Type: Improvement
>          Components: scheduler
>    Affects Versions: 1.10.0, 1.10.1, 1.10.2
>         Environment: ubuntu 14.04
>            Reporter: Amichai Horvitz
>            Assignee: Amichai Horvitz
>            Priority: Major
>             Fix For: 2.0.0
>
>   Original Estimate: 336h
>  Remaining Estimate: 336h
>
> I came across the TODO in airflow/ti_deps/deps/trigger_rule_dep (line 52) that says instead of checking the query for every task let the tasks report to the dagrun. I have a dag with many tasks and the delay between tasks can rise to 10 seconds or more, I already changed the configuration, added processes and memory, checked the code and did research, profiling and other experiments. I hope that this change will make a drastic change in the delay. I would be happy to discuss this solution, the research and other solutions for this issue.  
> Thanks



--
This message was sent by Atlassian Jira
(v8.3.4#803005)