You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@submarine.apache.org by "Kevin Su (Jira)" <ji...@apache.org> on 2020/09/21 10:05:00 UTC
[jira] [Updated] (SUBMARINE-42) Need to support the task to run
failed container does not destroy
[ https://issues.apache.org/jira/browse/SUBMARINE-42?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Kevin Su updated SUBMARINE-42:
------------------------------
Target Version: 0.6.0 (was: 0.5.0)
> Need to support the task to run failed container does not destroy
> -----------------------------------------------------------------
>
> Key: SUBMARINE-42
> URL: https://issues.apache.org/jira/browse/SUBMARINE-42
> Project: Apache Submarine
> Issue Type: Improvement
> Components: YARN-native-service
> Reporter: Xun Liu
> Priority: Major
>
> Now through a Submarine job run a tensorflow algorithm, If the JOB fails, All containers will be destroyed.
> So we have no way to locate the problem in the container. So we need YARN to support one parameter, The container is not destroyed after the task fails.
> If you add this feature, please let Yarn-service also support this interface. Because YARN's REST interface is relatively lightweight and easy to use.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@submarine.apache.org
For additional commands, e-mail: dev-help@submarine.apache.org