You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Aaron Davidson (JIRA)" <ji...@apache.org> on 2014/05/11 00:14:36 UTC
[jira] [Created] (SPARK-1771) CoarseGrainedSchedulerBackend is not
resilient to Akka restarts
Aaron Davidson created SPARK-1771:
-------------------------------------
Summary: CoarseGrainedSchedulerBackend is not resilient to Akka restarts
Key: SPARK-1771
URL: https://issues.apache.org/jira/browse/SPARK-1771
Project: Spark
Issue Type: Bug
Components: Spark Core
Reporter: Aaron Davidson
The exception reported in SPARK-1769 was propagated through the CoarseGrainedSchedulerBackend, and caused an Actor restart of the DriverActor. Unfortunately, this actor does not seem to have been written with Akka restartability in mind. For instance, the new DriverActor has lost all state about the prior Executors without cleanly disconnecting them. This means that the driver actually has executors attached to it, but doesn't think it does, which leads to mayhem of various sorts.
--
This message was sent by Atlassian JIRA
(v6.2#6252)