You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Joseph K. Bradley (JIRA)" <ji...@apache.org> on 2015/05/26 23:47:17 UTC

[jira] [Created] (SPARK-7881) KMeans API for spark.ml Pipelines

Joseph K. Bradley created SPARK-7881:
----------------------------------------

             Summary: KMeans API for spark.ml Pipelines
                 Key: SPARK-7881
                 URL: https://issues.apache.org/jira/browse/SPARK-7881
             Project: Spark
          Issue Type: New Feature
          Components: ML
            Reporter: Joseph K. Bradley


Create a K-Means API for the spark.ml Pipelines API.  This should wrap the existing KMeans implementation in spark.mllib.

This should be the first clustering method added to Pipelines, and it will be important to consider [SPARK-7610] and think about designing the clustering API.  We do not have to have abstractions from the beginning (and probably should not) but should think far enough ahead so we can add abstractions later on.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org