You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Joseph K. Bradley (JIRA)" <ji...@apache.org> on 2015/04/27 20:59:39 UTC

[jira] [Resolved] (SPARK-4736) functions returning the category with weights

     [ https://issues.apache.org/jira/browse/SPARK-4736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Joseph K. Bradley resolved SPARK-4736.
--------------------------------------
    Resolution: Duplicate

I'm closing this issue since it's a duplicate of the other, and since there has been more recent activity on the other JIRA.  If you're interested in continuing work, please coordinate on the other JIRA.  Thanks!  We're going to provide this functionality via the spark.ml API for trees.

> functions returning the category with weights
> ---------------------------------------------
>
>                 Key: SPARK-4736
>                 URL: https://issues.apache.org/jira/browse/SPARK-4736
>             Project: Spark
>          Issue Type: Improvement
>          Components: MLlib
>            Reporter: yu jiang
>
> In this version, we add two functions: 1) predictByVotingWithWeight(features: Vector) and 2) predictWithWeight(features: Vector). And we also modify the function: predictByVoting(features: Vector). There are at least two reasons why we make such improvement: 1) In our practice, we want to find the top N samples from one category. However in 1.3.0 version, the function of predict can only give the predicted category but without weights. 2) What's more, in our practice, the numbers of positive and negative samples are very unbalance. There are much less positive samples than negative samples. According to the results of votes, there are very few samples predicted as positive sample. If the weights are also given, users can make a proper threshold to modify the results so that the performance can be improved.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org