You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "angerszhu (Jira)" <ji...@apache.org> on 2021/10/22 07:47:00 UTC

[jira] [Created] (SPARK-37097) yarn-cluster mode, unregister timeout cause spark retry but AM container exit with code 0

angerszhu created SPARK-37097:
---------------------------------

             Summary: yarn-cluster mode, unregister timeout cause spark retry but AM container exit with code 0
                 Key: SPARK-37097
                 URL: https://issues.apache.org/jira/browse/SPARK-37097
             Project: Spark
          Issue Type: Sub-task
          Components: SQL
    Affects Versions: 3.2.0
            Reporter: angerszhu


Cluster mode AM shutdown hook triggered, am unregister from RM timeout, but AM shutdown hook have try catch, so AM container exit with code 0. But since RM lose connection with AM, then treat this container as failed.

Then client side got application report as final status failed but am container exit code 0. Then retry.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org