You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@yunikorn.apache.org by "Kinga Marton (Jira)" <ji...@apache.org> on 2021/06/10 10:24:00 UTC

[jira] [Reopened] (YUNIKORN-582) Consider a fallback mechanism to schedule the app in case of gang failure instead of marking the app as failed

     [ https://issues.apache.org/jira/browse/YUNIKORN-582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kinga Marton reopened YUNIKORN-582:
-----------------------------------

I am reopening this issue, since during end 2 end testing I found some issues.

> Consider a fallback mechanism to schedule the app in case of gang failure instead of marking the app as failed
> --------------------------------------------------------------------------------------------------------------
>
>                 Key: YUNIKORN-582
>                 URL: https://issues.apache.org/jira/browse/YUNIKORN-582
>             Project: Apache YuniKorn
>          Issue Type: Sub-task
>          Components: core - scheduler
>            Reporter: Ayub Pathan
>            Assignee: Kinga Marton
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 0.11
>
>
> Incases when the app encounters gang issues due to placeholder pod allocation(failed due to various reasons), currently yunikorn marks the app failed. 
> Instead, consider a configurable option for hard or soft gang scheduling which allows fallback mechanism to schedule the app successfully.  This needs to be brain stormed to see if this makes sense. Let us use this jira for documenting all the thoughts.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@yunikorn.apache.org
For additional commands, e-mail: issues-help@yunikorn.apache.org