You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Tao Yang (JIRA)" <ji...@apache.org> on 2018/08/01 02:21:00 UTC

[jira] [Commented] (YARN-8546) Resource leak caused by a reserved container being released more than once under async scheduling

    [ https://issues.apache.org/jira/browse/YARN-8546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16564638#comment-16564638 ] 

Tao Yang commented on YARN-8546:
--------------------------------

Thanks [~leftnoteasy]!

> Resource leak caused by a reserved container being released more than once under async scheduling
> -------------------------------------------------------------------------------------------------
>
>                 Key: YARN-8546
>                 URL: https://issues.apache.org/jira/browse/YARN-8546
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: capacity scheduler
>    Affects Versions: 3.1.0
>            Reporter: Weiwei Yang
>            Assignee: Tao Yang
>            Priority: Major
>              Labels: global-scheduling
>             Fix For: 3.2.0, 3.1.1
>
>         Attachments: YARN-8546.001.patch
>
>
> I was able to reproduce this issue by starting a job, and this job keeps requesting containers until it uses up cluster available resource. My cluster has 70200 vcores, and each task it applies for 100 vcores, I was expecting total 702 containers can be allocated but eventually there was only 701. The last container could not get allocated because queue used resource is updated to be more than 100%.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org