You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Davies Liu (JIRA)" <ji...@apache.org> on 2015/01/06 08:31:34 UTC

[jira] [Updated] (SPARK-5098) Number of running tasks become negative after tasks lost

     [ https://issues.apache.org/jira/browse/SPARK-5098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Davies Liu updated SPARK-5098:
------------------------------
    Description: 
15/01/06 07:26:58 ERROR TaskSchedulerImpl: Lost executor 6 on spark-worker-002.c.lofty-inn-754.internal: remote Akka client disassociated
15/01/06 07:26:58 WARN ReliableDeliverySupervisor: Association with remote system [akka.tcp://sparkExecutor@spark-worker-002.c.lofty-inn-754.internal:32852] has failed, address is now gated for [5000] ms. Reason is: [Disassociated].
15/01/06 07:26:58 WARN TaskSetManager: Lost task 10.2 in stage 0.0 (TID 55, spark-worker-002.c.lofty-inn-754.internal): ExecutorLostFailure (executor 6 lost)
15/01/06 07:26:58 WARN TaskSetManager: Lost task 7.2 in stage 0.0 (TID 52, spark-worker-002.c.lofty-inn-754.internal): ExecutorLostFailure (executor 6 lost)
15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 6
15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 6
[Stage 0:===========================================================(44 + -14) / 40]
15/01/06 07:27:10 ERROR TaskSchedulerImpl: Lost executor 2 on spark-worker-003.c.lofty-inn-754.internal: remote Akka client disassociated
15/01/06 07:27:10 WARN ReliableDeliverySupervisor: Association with remote system [akka.tcp://sparkExecutor@spark-worker-003.c.lofty-inn-754.internal:39188] has failed, address is now gated for [5000] ms. Reason is: [Disassociated].
15/01/06 07:27:10 WARN TaskSetManager: Lost task 16.1 in stage 0.0 (TID 60, spark-worker-003.c.lofty-inn-754.internal): ExecutorLostFailure (executor 2 lost)
15/01/06 07:27:10 WARN TaskSetManager: Lost task 12.0 in stage 0.0 (TID 12, spark-worker-003.c.lofty-inn-754.internal): ExecutorLostFailure (executor 2 lost)
15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 2
15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 2
[Stage 0:==========================================================(45 + -29) / 40]

  was:
15/01/06 07:26:58 ERROR TaskSchedulerImpl: Lost executor 6 on spark-worker-002.c.lofty-inn-754.internal: remote Akka client disassociated
15/01/06 07:26:58 WARN ReliableDeliverySupervisor: Association with remote system [akka.tcp://sparkExecutor@spark-worker-002.c.lofty-inn-754.internal:32852] has failed, address is now gated for [5000] ms. Reason is: [Disassociated].
15/01/06 07:26:58 WARN TaskSetManager: Lost task 10.2 in stage 0.0 (TID 55, spark-worker-002.c.lofty-inn-754.internal): ExecutorLostFailure (executor 6 lost)
15/01/06 07:26:58 WARN TaskSetManager: Lost task 7.2 in stage 0.0 (TID 52, spark-worker-002.c.lofty-inn-754.internal): ExecutorLostFailure (executor 6 lost)
15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 6
15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 6
[Stage 0:=================================================================================================================(44 + -14) / 40]
15/01/06 07:27:10 ERROR TaskSchedulerImpl: Lost executor 2 on spark-worker-003.c.lofty-inn-754.internal: remote Akka client disassociated
15/01/06 07:27:10 WARN ReliableDeliverySupervisor: Association with remote system [akka.tcp://sparkExecutor@spark-worker-003.c.lofty-inn-754.internal:39188] has failed, address is now gated for [5000] ms. Reason is: [Disassociated].
15/01/06 07:27:10 WARN TaskSetManager: Lost task 16.1 in stage 0.0 (TID 60, spark-worker-003.c.lofty-inn-754.internal): ExecutorLostFailure (executor 2 lost)
15/01/06 07:27:10 WARN TaskSetManager: Lost task 12.0 in stage 0.0 (TID 12, spark-worker-003.c.lofty-inn-754.internal): ExecutorLostFailure (executor 2 lost)
15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 2
15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 2
[Stage 0:=================================================================================================================(45 + -29) / 40]


> Number of running tasks become negative after tasks lost
> --------------------------------------------------------
>
>                 Key: SPARK-5098
>                 URL: https://issues.apache.org/jira/browse/SPARK-5098
>             Project: Spark
>          Issue Type: Bug
>          Components: Spark Core
>    Affects Versions: 1.2.0
>            Reporter: Davies Liu
>            Priority: Critical
>
> 15/01/06 07:26:58 ERROR TaskSchedulerImpl: Lost executor 6 on spark-worker-002.c.lofty-inn-754.internal: remote Akka client disassociated
> 15/01/06 07:26:58 WARN ReliableDeliverySupervisor: Association with remote system [akka.tcp://sparkExecutor@spark-worker-002.c.lofty-inn-754.internal:32852] has failed, address is now gated for [5000] ms. Reason is: [Disassociated].
> 15/01/06 07:26:58 WARN TaskSetManager: Lost task 10.2 in stage 0.0 (TID 55, spark-worker-002.c.lofty-inn-754.internal): ExecutorLostFailure (executor 6 lost)
> 15/01/06 07:26:58 WARN TaskSetManager: Lost task 7.2 in stage 0.0 (TID 52, spark-worker-002.c.lofty-inn-754.internal): ExecutorLostFailure (executor 6 lost)
> 15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 6
> 15/01/06 07:26:58 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 6
> [Stage 0:===========================================================(44 + -14) / 40]
> 15/01/06 07:27:10 ERROR TaskSchedulerImpl: Lost executor 2 on spark-worker-003.c.lofty-inn-754.internal: remote Akka client disassociated
> 15/01/06 07:27:10 WARN ReliableDeliverySupervisor: Association with remote system [akka.tcp://sparkExecutor@spark-worker-003.c.lofty-inn-754.internal:39188] has failed, address is now gated for [5000] ms. Reason is: [Disassociated].
> 15/01/06 07:27:10 WARN TaskSetManager: Lost task 16.1 in stage 0.0 (TID 60, spark-worker-003.c.lofty-inn-754.internal): ExecutorLostFailure (executor 2 lost)
> 15/01/06 07:27:10 WARN TaskSetManager: Lost task 12.0 in stage 0.0 (TID 12, spark-worker-003.c.lofty-inn-754.internal): ExecutorLostFailure (executor 2 lost)
> 15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 2
> 15/01/06 07:27:10 ERROR SparkDeploySchedulerBackend: Asked to remove non-existent executor 2
> [Stage 0:==========================================================(45 + -29) / 40]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org