You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Sean Owen (JIRA)" <ji...@apache.org> on 2014/06/17 10:27:01 UTC
[jira] [Commented] (SPARK-2160) error of Decision tree algorithm
in Spark MLlib
[ https://issues.apache.org/jira/browse/SPARK-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14033550#comment-14033550 ]
Sean Owen commented on SPARK-2160:
----------------------------------
You already added this as https://issues.apache.org/jira/browse/SPARK-2152 right?
> error of Decision tree algorithm in Spark MLlib
> --------------------------------------------------
>
> Key: SPARK-2160
> URL: https://issues.apache.org/jira/browse/SPARK-2160
> Project: Spark
> Issue Type: Bug
> Components: MLlib
> Affects Versions: 1.0.0
> Reporter: caoli
> Labels: patch
> Fix For: 1.1.0
>
> Original Estimate: 4h
> Remaining Estimate: 4h
>
> the error of comput rightNodeAgg about Decision tree algorithm in Spark MLlib , in the function extractLeftRightNodeAggregates() ,when compute rightNodeAgg used bindata index is error. in the DecisionTree.scala file about Line980:
> rightNodeAgg(featureIndex)(2 * (numBins - 2 - splitIndex)) =
> binData(shift + (2 * (numBins - 2 - splitIndex))) +
> rightNodeAgg(featureIndex)(2 * (numBins - 1 - splitIndex))
> the binData(shift + (2 * (numBins - 2 - splitIndex))) index compute is error, so the result of rightNodeAgg include repeated data about "bins"
--
This message was sent by Atlassian JIRA
(v6.2#6252)