You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@yunikorn.apache.org by "Manikandan R (Jira)" <ji...@apache.org> on 2021/02/23 13:51:00 UTC
[jira] [Commented] (YUNIKORN-520) Gang scheduling waits
indefinitely for placeholder pod allocation even where is no quota left in
the queue
[ https://issues.apache.org/jira/browse/YUNIKORN-520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17289092#comment-17289092 ]
Manikandan R commented on YUNIKORN-520:
---------------------------------------
[~wwei] [~wilfreds] I've created a PR with necessary changes. Can you please review?
> Gang scheduling waits indefinitely for placeholder pod allocation even where is no quota left in the queue
> ----------------------------------------------------------------------------------------------------------
>
> Key: YUNIKORN-520
> URL: https://issues.apache.org/jira/browse/YUNIKORN-520
> Project: Apache YuniKorn
> Issue Type: Sub-task
> Components: shim - kubernetes
> Affects Versions: 0.10
> Reporter: Ayub Pathan
> Assignee: Weiwei Yang
> Priority: Major
> Labels: pull-request-available
> Attachments: job.yaml, ns.yaml
>
>
> * Create a namespace([^ns.yaml] with quota.
> * Submit a job([^job.yaml] where the placeholder pods resource requests are more than queue quota.
> The YK waits indefinitely for the placeholder pods minMember limit to be satisfied. Gang scheduling can action to avoid this indefinite lock situation, incase of resource limits.
>
> {noformat}
> NAME↑ READY STATUS RS CPU MEM %CPU/R %MEM/R %CPU/L %MEM/L IP NODE QOS AGE │
> │ batch-sleep-job-4-hzc49 0/0 Pending 0 n/a n/a n/a n/a n/a n/a n/a n/a BU 4h28m │
> │ batch-sleep-job-4-jmqbf 0/0 Pending 0 n/a n/a n/a n/a n/a n/a n/a n/a BU 4h28m │
> │ batch-sleep-job-4-kg7hl 0/0 Pending 0 n/a n/a n/a n/a n/a n/a n/a n/a BU 4h28m │
> │ tg-tg1-batch-sleep-job-4-0 1/1 Running 0 n/a n/a n/a n/a n/a n/a 100.100.134.233 ip-10-192-136-201.ca-central-1.compute.internal BU 4h28m │
> │ tg-tg1-batch-sleep-job-4-1 1/1 Running 0 n/a n/a n/a n/a n/a n/a 100.100.87.75 ip-10-192-159-151.ca-central-1.compute.internal BU 4h28m │
> │ tg-tg1-batch-sleep-job-4-2 0/0 Pending 0 n/a n/a n/a n/a n/a n/a n/a n/a BU 4h28m │ {noformat}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@yunikorn.apache.org
For additional commands, e-mail: issues-help@yunikorn.apache.org