You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Max Kaznady (JIRA)" <ji...@apache.org> on 2015/04/13 21:05:12 UTC

[jira] [Commented] (SPARK-6884) random forest predict probabilities functionality (like in sklearn)

    [ https://issues.apache.org/jira/browse/SPARK-6884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14492868#comment-14492868 ] 

Max Kaznady commented on SPARK-6884:
------------------------------------

Implemented a prototype, testing mapReduce code.

> random forest predict probabilities functionality (like in sklearn)
> -------------------------------------------------------------------
>
>                 Key: SPARK-6884
>                 URL: https://issues.apache.org/jira/browse/SPARK-6884
>             Project: Spark
>          Issue Type: New Feature
>          Components: MLlib
>    Affects Versions: 1.4.0
>         Environment: cross-platform
>            Reporter: Max Kaznady
>              Labels: prediction, probability, randomforest, tree
>   Original Estimate: 72h
>  Remaining Estimate: 72h
>
> Currently, there is no way to extract the class probabilities from the RandomForest classifier. I implemented a probability predictor by counting votes from individual trees and adding up their votes for "1" and then dividing by the total number of votes.
> I opened this ticked to keep track of changes. Will update once I push my code to master.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org