You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2015/08/13 07:28:45 UTC

[jira] [Updated] (SPARK-1655) In naive Bayes, store conditional probabilities distributively.

     [ https://issues.apache.org/jira/browse/SPARK-1655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Xiangrui Meng updated SPARK-1655:
---------------------------------
    Target Version/s: 1.6.0  (was: 1.5.0)

> In naive Bayes, store conditional probabilities distributively.
> ---------------------------------------------------------------
>
>                 Key: SPARK-1655
>                 URL: https://issues.apache.org/jira/browse/SPARK-1655
>             Project: Spark
>          Issue Type: Improvement
>          Components: MLlib
>            Reporter: Xiangrui Meng
>            Assignee: Aaron Staple
>
> In the current implementation, we collect all conditional probabilities to the driver node. When there are many labels and many features, this puts heavy load on the driver. For scalability, we should provide a way to store conditional probabilities distributively.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org