You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@yunikorn.apache.org by "Weiwei Yang (Jira)" <ji...@apache.org> on 2021/06/05 03:57:00 UTC

[jira] [Resolved] (YUNIKORN-582) Consider a fallback mechanism to schedule the app in case of gang failure instead of marking the app as failed

     [ https://issues.apache.org/jira/browse/YUNIKORN-582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Weiwei Yang resolved YUNIKORN-582.
----------------------------------
    Fix Version/s: 0.11
       Resolution: Fixed

> Consider a fallback mechanism to schedule the app in case of gang failure instead of marking the app as failed
> --------------------------------------------------------------------------------------------------------------
>
>                 Key: YUNIKORN-582
>                 URL: https://issues.apache.org/jira/browse/YUNIKORN-582
>             Project: Apache YuniKorn
>          Issue Type: Sub-task
>          Components: core - scheduler
>            Reporter: Ayub Pathan
>            Assignee: Kinga Marton
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 0.11
>
>
> Incases when the app encounters gang issues due to placeholder pod allocation(failed due to various reasons), currently yunikorn marks the app failed. 
> Instead, consider a configurable option for hard or soft gang scheduling which allows fallback mechanism to schedule the app successfully.  This needs to be brain stormed to see if this makes sense. Let us use this jira for documenting all the thoughts.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@yunikorn.apache.org
For additional commands, e-mail: issues-help@yunikorn.apache.org