You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Venkata Vineel (JIRA)" <ji...@apache.org> on 2015/07/06 11:05:05 UTC

[jira] [Commented] (SPARK-6001) K-Means clusterer should return the assignments of input points to clusters

    [ https://issues.apache.org/jira/browse/SPARK-6001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14614720#comment-14614720 ] 

Venkata Vineel commented on SPARK-6001:
---------------------------------------

[~derrickburns]  Can you please assign this to me.

> K-Means clusterer should return the assignments of input points to clusters
> ---------------------------------------------------------------------------
>
>                 Key: SPARK-6001
>                 URL: https://issues.apache.org/jira/browse/SPARK-6001
>             Project: Spark
>          Issue Type: Improvement
>          Components: MLlib
>    Affects Versions: 1.2.1
>            Reporter: Derrick Burns
>            Priority: Minor
>
> The K-Means clusterer returns a KMeansModel that contains the cluster centers. However, when available, I suggest that the K-Means clusterer also return an RDD of the assignments of the input data to the clusters. While the assignments can be computed given the KMeansModel, why not return assignments if they are available to save re-computation costs.
> The K-means implementation at https://github.com/derrickburns/generalized-kmeans-clustering returns the assignments when available.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org