You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Hudson (JIRA)" <ji...@apache.org> on 2018/12/05 23:02:00 UTC

[jira] [Commented] (YARN-8994) Fix race condition between move app and queue cleanup in Fair Scheduler

    [ https://issues.apache.org/jira/browse/YARN-8994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16710722#comment-16710722 ] 

Hudson commented on YARN-8994:
------------------------------

SUCCESS: Integrated in Jenkins build Hadoop-trunk-Commit #15564 (See [https://builds.apache.org/job/Hadoop-trunk-Commit/15564/])
YARN-8994. Fix race condition between move app and queue cleanup in Fair (haibochen: rev 2bba329c141dea17a1a9f7112f7155e11264ccfd)
* (edit) hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/fair/FairScheduler.java


> Fix race condition between move app and queue cleanup in Fair Scheduler
> -----------------------------------------------------------------------
>
>                 Key: YARN-8994
>                 URL: https://issues.apache.org/jira/browse/YARN-8994
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: fairscheduler
>    Affects Versions: 3.2.0
>            Reporter: Wilfred Spiegelenburg
>            Assignee: Wilfred Spiegelenburg
>            Priority: Major
>             Fix For: 3.3.0
>
>         Attachments: YARN-8994.001.patch
>
>
> Similar to YARN-8990 and also introduced by YARN-8191 there is a race condition while moving an application. The pre-move check looks for the queue and when it finds the queue it progresses. The real move then retrieves the queue and does further check before updating the app and queues.
> The move uses the retrieved queue object but the queue could have become empty while checks are performed. If the cleanup runs at that same time the app will be moved to a deleted queue and lost.  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org