You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@spark.apache.org by bhusted <br...@gmail.com> on 2014/09/25 21:28:55 UTC

Re: K-means faster on Mahout then on Spark

What is the size of your vector mine is set to 20? I am seeing slow results
as well with iteration=5, # of elements 200,000,000. 







--
View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/K-means-faster-on-Mahout-then-on-Spark-tp3195p15168.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
For additional commands, e-mail: user-help@spark.apache.org

Re: K-means faster on Mahout then on Spark

Posted by Xiangrui Meng <me...@gmail.com>.

Please also check the load balance of the RDD on YARN. How many
partitions are you using? Does it match the number of CPU cores?
-Xiangrui

On Thu, Sep 25, 2014 at 12:28 PM, bhusted <br...@gmail.com> wrote:
> What is the size of your vector mine is set to 20? I am seeing slow results
> as well with iteration=5, # of elements 200,000,000.
>
>
>
>
>
>
>
> --
> View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/K-means-faster-on-Mahout-then-on-Spark-tp3195p15168.html
> Sent from the Apache Spark User List mailing list archive at Nabble.com.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
> For additional commands, e-mail: user-help@spark.apache.org
>

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
For additional commands, e-mail: user-help@spark.apache.org