You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@submarine.apache.org by "Kevin Su (Jira)" <ji...@apache.org> on 2022/01/25 08:02:00 UTC

[jira] [Resolved] (SUBMARINE-42) Need to support the task to run failed container does not destroy

     [ https://issues.apache.org/jira/browse/SUBMARINE-42?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kevin Su resolved SUBMARINE-42.
-------------------------------
    Resolution: Won't Fix

> Need to support the task to run failed container does not destroy
> -----------------------------------------------------------------
>
>                 Key: SUBMARINE-42
>                 URL: https://issues.apache.org/jira/browse/SUBMARINE-42
>             Project: Apache Submarine
>          Issue Type: Improvement
>          Components: YARN-native-service
>            Reporter: Xun Liu
>            Priority: Major
>
> Now through a Submarine job run a tensorflow algorithm, If the JOB fails, All containers will be destroyed.
> So we have no way to locate the problem in the container. So we need YARN to support one parameter, The container is not destroyed after the task fails.
> If you add this feature, please let Yarn-service also support this interface. Because YARN's REST interface is relatively lightweight and easy to use.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@submarine.apache.org
For additional commands, e-mail: dev-help@submarine.apache.org