You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Kishor Patil (JIRA)" <ji...@apache.org> on 2016/09/12 22:47:20 UTC

[jira] [Created] (SPARK-17511) Dynamic allocation race condition: Containers getting marked failed while releasing

Kishor Patil created SPARK-17511:
------------------------------------

             Summary: Dynamic allocation race condition: Containers getting marked failed while releasing
                 Key: SPARK-17511
                 URL: https://issues.apache.org/jira/browse/SPARK-17511
             Project: Spark
          Issue Type: Bug
          Components: YARN
    Affects Versions: 2.0.0, 2.0.1, 2.1.0
            Reporter: Kishor Patil


While trying to reach launch multiple containers in pool, if running executors count reaches or goes beyond the target running executors, the container is released and marked failed. This can cause many jobs to be marked failed causing overall job failure.

I will have a patch up soon after completing testing.





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org