You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "zhengruifeng (Jira)" <ji...@apache.org> on 2019/11/20 01:35:00 UTC

[jira] [Created] (SPARK-29967) KMeans support instance weighting

zhengruifeng created SPARK-29967:
------------------------------------

             Summary: KMeans support instance weighting
                 Key: SPARK-29967
                 URL: https://issues.apache.org/jira/browse/SPARK-29967
             Project: Spark
          Issue Type: Improvement
          Components: ML, PySpark
    Affects Versions: 3.0.0
            Reporter: zhengruifeng


Since https://issues.apache.org/jira/browse/SPARK-9610, we start to support instance weighting in ML.

However, Clustering and other impl in features still do not support instance weighting.

I think we need to start support weighting in KMeans, like what scikit-learn does.

It will contains three parts:

1, move the impl from .mllib to .ml

2, make .mllib.KMeans as a wrapper of .ml.KMeans

3, support instance weighting in the .ml.KMeans



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org