You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@yunikorn.apache.org by "Wilfred Spiegelenburg (Jira)" <ji...@apache.org> on 2021/02/24 01:38:00 UTC

[jira] [Resolved] (YUNIKORN-520) Gang scheduling waits indefinitely for placeholder pod allocation even where is no quota left in the queue

     [ https://issues.apache.org/jira/browse/YUNIKORN-520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Wilfred Spiegelenburg resolved YUNIKORN-520.
--------------------------------------------
    Resolution: Fixed

> Gang scheduling waits indefinitely for placeholder pod allocation even where is no quota left in the queue
> ----------------------------------------------------------------------------------------------------------
>
>                 Key: YUNIKORN-520
>                 URL: https://issues.apache.org/jira/browse/YUNIKORN-520
>             Project: Apache YuniKorn
>          Issue Type: Sub-task
>          Components: shim - kubernetes
>    Affects Versions: 0.10
>            Reporter: Ayub Pathan
>            Assignee: Manikandan R
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 0.10
>
>         Attachments: job.yaml, ns.yaml
>
>
> * Create a namespace([^ns.yaml] with quota.
>  * Submit a job([^job.yaml] where the placeholder pods resource requests are more than queue quota.
> The YK waits indefinitely for the placeholder pods minMember limit to be satisfied. Gang scheduling can action to avoid this indefinite lock situation, incase of resource limits.
>  
> {noformat}
> NAME↑                       READY STATUS   RS CPU MEM %CPU/R %MEM/R %CPU/L %MEM/L IP                NODE                                              QOS  AGE      │
> │ batch-sleep-job-4-hzc49     0/0   Pending   0 n/a n/a    n/a    n/a    n/a    n/a n/a               n/a                                               BU   4h28m    │
> │ batch-sleep-job-4-jmqbf     0/0   Pending   0 n/a n/a    n/a    n/a    n/a    n/a n/a               n/a                                               BU   4h28m    │
> │ batch-sleep-job-4-kg7hl     0/0   Pending   0 n/a n/a    n/a    n/a    n/a    n/a n/a               n/a                                               BU   4h28m    │
> │ tg-tg1-batch-sleep-job-4-0  1/1   Running   0 n/a n/a    n/a    n/a    n/a    n/a 100.100.134.233   ip-10-192-136-201.ca-central-1.compute.internal   BU   4h28m    │
> │ tg-tg1-batch-sleep-job-4-1  1/1   Running   0 n/a n/a    n/a    n/a    n/a    n/a 100.100.87.75     ip-10-192-159-151.ca-central-1.compute.internal   BU   4h28m    │
> │ tg-tg1-batch-sleep-job-4-2  0/0   Pending   0 n/a n/a    n/a    n/a    n/a    n/a n/a               n/a                                               BU   4h28m    │ {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@yunikorn.apache.org
For additional commands, e-mail: dev-help@yunikorn.apache.org