You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Perinkulam I Ganesh (JIRA)" <ji...@apache.org> on 2015/07/21 15:41:04 UTC

[jira] [Commented] (SPARK-3157) Avoid duplicated stats in DecisionTree extractLeftRightNodeAggregates

    [ https://issues.apache.org/jira/browse/SPARK-3157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14635117#comment-14635117 ] 

Perinkulam I Ganesh commented on SPARK-3157:
--------------------------------------------

This function has been removed under SPARK-3043. Is this JIRA still valid ?

thanks

> Avoid duplicated stats in DecisionTree extractLeftRightNodeAggregates
> ---------------------------------------------------------------------
>
>                 Key: SPARK-3157
>                 URL: https://issues.apache.org/jira/browse/SPARK-3157
>             Project: Spark
>          Issue Type: Improvement
>          Components: MLlib
>            Reporter: Joseph K. Bradley
>            Priority: Minor
>
> Improvement: computation, memory usage
> For ordered features, extractLeftRightNodeAggregates() computes pairs of cumulative sums.  However, these sums are redundant since they are simply cumulative sums accumulating from the left and right ends, respectively.  Only compute one sum.
> For unordered features, the left and right aggregates are essentially the same data, copied from the original aggregates, but shifted by one index.  Avoid copying data.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org