You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Alex Angelini (JIRA)" <ji...@apache.org> on 2015/08/07 19:34:45 UTC

[jira] [Created] (SPARK-9745) Applications hangs when the last executor fails with dynamic allocation

Alex Angelini created SPARK-9745:
------------------------------------

             Summary: Applications hangs when the last executor fails with dynamic allocation
                 Key: SPARK-9745
                 URL: https://issues.apache.org/jira/browse/SPARK-9745
             Project: Spark
          Issue Type: Bug
          Components: PySpark, Scheduler, YARN
    Affects Versions: 1.5.0
         Environment: YARN + Pyspark + Dynamic Allocation
            Reporter: Alex Angelini


When a job has only a single executor remaining and that executor dies (due to something like an OOM), the application fails to notice that there are no executors left and it hangs indefinitely.

This only happens when dynamic allocation is enabled.

The following images were taken from hung application with no executors:



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org