You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by Ovidiu-Cristian MARCU <ov...@inria.fr> on 2016/02/16 22:19:42 UTC

spark examples Analytics ConnectedComponents - keep running, nothing in output

Hi

I’m trying to run Analytics cc (ConnectedComponents) but it is running without ending.
Logs are fine, but I just keep getting Job xyz finished, reduce took some time:

...
INFO DAGScheduler: Job 29 finished: reduce at VertexRDDImpl.scala:90, took 14.828033 s
INFO DAGScheduler: Job 30 finished: reduce at VertexRDDImpl.scala:90, took 15.341294 s
..

..
INFO TaskSetManager: Finished task 299.0 in stage 53059.0 (TID 88025) in 81 ms on 172.16.99.22 (195/480)
INFO TaskSetManager: Starting task 47.0 in stage 53059.0 (TID 88075, 172.16.99.31, PROCESS_LOCAL, 5367 bytes)
..

I am using Spark 1.5 standalone and input graph http://snap.stanford.edu/data/web-BerkStan.html <http://snap.stanford.edu/data/web-BerkStan.html>

It seems there is no convergence, can you help me understand what is wrong in your example?

Thanks

Best,
Ovidiu