You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Sean Owen (JIRA)" <ji...@apache.org> on 2015/05/04 13:46:18 UTC

[jira] [Updated] (SPARK-5098) Number of running tasks become negative after tasks lost

     [ https://issues.apache.org/jira/browse/SPARK-5098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sean Owen updated SPARK-5098:
-----------------------------
    Target Version/s:   (was: 1.2.1, 1.3.0)

> Number of running tasks become negative after tasks lost
> --------------------------------------------------------
>
>                 Key: SPARK-5098
>                 URL: https://issues.apache.org/jira/browse/SPARK-5098
>             Project: Spark
>          Issue Type: Bug
>          Components: Spark Core
>    Affects Versions: 1.2.0
>            Reporter: Davies Liu
>            Priority: Critical
>
> 15/01/06 07:26:58 ERROR TaskSchedulerImpl: Lost executor 6 on spark-worker-002.c.lofty-inn-754.internal: remote Akka client disassociated
> 15/01/06 07:26:58 WARN ReliableDeliverySupervisor: Association with remote system [akka.tcp://sparkExecutor@spark-worker-002.c.lofty-inn-754.internal:32852] has failed, address is now gated for [5000] ms. Reason is: [Disassociated].
> 15/01/06 07:26:58 WARN TaskSetManager: Lost task 10.2 in stage 0.0 (TID 55, spark-worker-002.c.lofty-inn-754.internal): ExecutorLostFailure (executor 6 lost)
> 15/01/06 07:26:58 WARN TaskSetManager: Lost task 7.2 in stage 0.0 (TID 52, spark-worker-002.c.lofty-inn-754.internal): ExecutorLostFailure (executor 6 lost)
> 15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 6
> 15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 6
> [Stage 0:===========================================================(44 + -14) / 40]
> 15/01/06 07:27:10 ERROR TaskSchedulerImpl: Lost executor 2 on spark-worker-003.c.lofty-inn-754.internal: remote Akka client disassociated
> 15/01/06 07:27:10 WARN ReliableDeliverySupervisor: Association with remote system [akka.tcp://sparkExecutor@spark-worker-003.c.lofty-inn-754.internal:39188] has failed, address is now gated for [5000] ms. Reason is: [Disassociated].
> 15/01/06 07:27:10 WARN TaskSetManager: Lost task 16.1 in stage 0.0 (TID 60, spark-worker-003.c.lofty-inn-754.internal): ExecutorLostFailure (executor 2 lost)
> 15/01/06 07:27:10 WARN TaskSetManager: Lost task 12.0 in stage 0.0 (TID 12, spark-worker-003.c.lofty-inn-754.internal): ExecutorLostFailure (executor 2 lost)
> 15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 2
> 15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 2
> [Stage 0:==========================================================(45 + -29) / 40]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org