You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Matthew Farrellee (JIRA)" <ji...@apache.org> on 2014/09/21 17:28:33 UTC

[jira] [Commented] (SPARK-538) INFO spark.MesosScheduler: Ignoring update from TID 9 because its job is gone

    [ https://issues.apache.org/jira/browse/SPARK-538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14142475#comment-14142475 ] 

Matthew Farrellee commented on SPARK-538:
-----------------------------------------

this is a reasonable question for the user list, see http://spark.apache.org/community.html. i'm going to close this in favor of user list interaction. if you disagree, please re-open.

> INFO spark.MesosScheduler: Ignoring update from TID 9 because its job is gone
> -----------------------------------------------------------------------------
>
>                 Key: SPARK-538
>                 URL: https://issues.apache.org/jira/browse/SPARK-538
>             Project: Spark
>          Issue Type: Bug
>            Reporter: vince67
>
> Hi Matei,
>                Maybe I can't descibe it clearly.
>                We run masters or slaves on different machines,it is success.
>                But when we run spark.examples.SparkPi on the master , our process hangs,we have not got the result.
>                Descirption like these:
>  
>                
> 12/09/02 16:47:54 INFO spark.BoundedMemoryCache: BoundedMemoryCache.maxBytes = 339585269
> 12/09/02 16:47:54 INFO spark.CacheTrackerActor: Registered actor on port 7077
> 12/09/02 16:47:54 INFO spark.CacheTrackerActor: Started slave cache (size 323.9MB) on vince67-ThinkCentre-XXXX
> 12/09/02 16:47:54 INFO spark.MapOutputTrackerActor: Registered actor on port 7077
> 12/09/02 16:47:54 INFO spark.ShuffleManager: Shuffle dir: /tmp/spark-local-3e79b235-1b94-44d1-823b-0369f6698688/shuffle
> 12/09/02 16:47:54 INFO server.Server: jetty-7.5.3.v20111011
> 12/09/02 16:47:54 INFO server.AbstractConnector: Started SelectChannelConnector@0.0.0.0:49578 STARTING
> 12/09/02 16:47:54 INFO spark.ShuffleManager: Local URI: http://ip.ip.ip.ip:49578
> 12/09/02 16:47:55 INFO server.Server: jetty-7.5.3.v20111011
> 12/09/02 16:47:55 INFO server.AbstractConnector: Started SelectChannelConnector@0.0.0.0:49600 STARTING
> 12/09/02 16:47:55 INFO broadcast.HttpBroadcast: Broadcast server started at http://ip.ip.ip.ip:49600
> 12/09/02 16:47:55 INFO spark.MesosScheduler: Registered as framework ID 201209021640-74572372-5050-16898-0004
> 12/09/02 16:47:55 INFO spark.SparkContext: Starting job...
> 12/09/02 16:47:55 INFO spark.CacheTracker: Registering RDD ID 1 with cache
> 12/09/02 16:47:55 INFO spark.CacheTrackerActor: Registering RDD 1 with 2 partitions
> 12/09/02 16:47:55 INFO spark.CacheTracker: Registering RDD ID 0 with cache
> 12/09/02 16:47:55 INFO spark.CacheTrackerActor: Registering RDD 0 with 2 partitions
> 12/09/02 16:47:55 INFO spark.CacheTrackerActor: Asked for current cache locations
> 12/09/02 16:47:55 INFO spark.MesosScheduler: Final stage: Stage 0
> 12/09/02 16:47:55 INFO spark.MesosScheduler: Parents of final stage: List()
> 12/09/02 16:47:55 INFO spark.MesosScheduler: Missing parents: List()
> 12/09/02 16:47:55 INFO spark.MesosScheduler: Submitting Stage 0, which has no missing parents
> 12/09/02 16:47:55 INFO spark.MesosScheduler: Got a job with 2 tasks
> 12/09/02 16:47:55 INFO spark.MesosScheduler: Adding job with ID 0
> 12/09/02 16:47:55 INFO spark.SimpleJob: Starting task 0:0 as TID 0 on slave 201209021640-74572372-5050-16898-2: lmrspark-G41MT-S2 (preferred)
> 12/09/02 16:47:55 INFO spark.SimpleJob: Size of task 0:0 is 1606 bytes and took 151 ms to serialize by spark.JavaSerializerInstance
> 12/09/02 16:47:55 INFO spark.SimpleJob: Starting task 0:1 as TID 1 on slave 201209021640-74572372-5050-16898-2: lmrspark-G41MT-S2 (preferred)
> 12/09/02 16:47:55 INFO spark.SimpleJob: Size of task 0:1 is 1606 bytes and took 1 ms to serialize by spark.JavaSerializerInstance
> 12/09/02 16:47:56 INFO spark.SimpleJob: Lost TID 0 (task 0:0)
> 12/09/02 16:47:56 INFO spark.SimpleJob: Starting task 0:0 as TID 2 on slave 201209021640-74572372-5050-16898-2: lmrspark-G41MT-S2 (preferred)
> 12/09/02 16:47:56 INFO spark.SimpleJob: Size of task 0:0 is 1606 bytes and took 1 ms to serialize by spark.JavaSerializerInstance
> 12/09/02 16:47:56 INFO spark.SimpleJob: Lost TID 1 (task 0:1)
> 12/09/02 16:47:56 INFO spark.SimpleJob: Starting task 0:1 as TID 3 on slave 201209021640-74572372-5050-16898-2: lmrspark-G41MT-S2 (preferred)
> 12/09/02 16:47:56 INFO spark.SimpleJob: Size of task 0:1 is 1606 bytes and took 5 ms to serialize by spark.JavaSerializerInstance
> 12/09/02 16:47:57 INFO spark.SimpleJob: Lost TID 2 (task 0:0)
> 12/09/02 16:47:57 INFO spark.SimpleJob: Starting task 0:0 as TID 4 on slave 201209021640-74572372-5050-16898-2: lmrspark-G41MT-S2 (preferred)
> 12/09/02 16:47:57 INFO spark.SimpleJob: Size of task 0:0 is 1606 bytes and took 1 ms to serialize by spark.JavaSerializerInstance
> 12/09/02 16:47:57 INFO spark.SimpleJob: Lost TID 3 (task 0:1)
> 12/09/02 16:47:57 INFO spark.SimpleJob: Starting task 0:1 as TID 5 on slave 201209021640-74572372-5050-16898-2: lmrspark-G41MT-S2 (preferred)
> 12/09/02 16:47:57 INFO spark.SimpleJob: Size of task 0:1 is 1606 bytes and took 2 ms to serialize by spark.JavaSerializerInstance
> 12/09/02 16:47:58 INFO spark.SimpleJob: Lost TID 4 (task 0:0)
> 12/09/02 16:47:58 INFO spark.SimpleJob: Starting task 0:0 as TID 6 on slave 201209021640-74572372-5050-16898-2: lmrspark-G41MT-S2 (preferred)
> 12/09/02 16:47:58 INFO spark.SimpleJob: Size of task 0:0 is 1606 bytes and took 1 ms to serialize by spark.JavaSerializerInstance
> 12/09/02 16:47:58 INFO spark.SimpleJob: Lost TID 5 (task 0:1)
> 12/09/02 16:47:58 INFO spark.SimpleJob: Starting task 0:1 as TID 7 on slave 201209021640-74572372-5050-16898-2: lmrspark-G41MT-S2 (preferred)
> 12/09/02 16:47:58 INFO spark.SimpleJob: Size of task 0:1 is 1606 bytes and took 1 ms to serialize by spark.JavaSerializerInstance
> 12/09/02 16:47:59 INFO spark.SimpleJob: Lost TID 6 (task 0:0)
> 12/09/02 16:47:59 INFO spark.SimpleJob: Starting task 0:0 as TID 8 on slave 201209021640-74572372-5050-16898-2: lmrspark-G41MT-S2 (preferred)
> 12/09/02 16:47:59 INFO spark.SimpleJob: Size of task 0:0 is 1606 bytes and took 1 ms to serialize by spark.JavaSerializerInstance
> 12/09/02 16:47:59 INFO spark.SimpleJob: Lost TID 7 (task 0:1)
> 12/09/02 16:47:59 INFO spark.SimpleJob: Starting task 0:1 as TID 9 on slave 201209021640-74572372-5050-16898-2: lmrspark-G41MT-S2 (preferred)
> 12/09/02 16:47:59 INFO spark.SimpleJob: Size of task 0:1 is 1606 bytes and took 1 ms to serialize by spark.JavaSerializerInstance
> 12/09/02 16:48:00 INFO spark.SimpleJob: Lost TID 8 (task 0:0)
> 12/09/02 16:48:00 ERROR spark.SimpleJob: Task 0:0 failed more than 4 times; aborting job
> 12/09/02 16:48:00 INFO spark.MesosScheduler: Ignoring update from TID 9 because its job is gone
>                  Your help will be appreciate.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org