You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Joseph K. Bradley (JIRA)" <ji...@apache.org> on 2017/02/10 20:04:42 UTC

[jira] [Commented] (SPARK-14523) Feature parity for Statistics ML with MLlib

    [ https://issues.apache.org/jira/browse/SPARK-14523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15861789#comment-15861789 ] 

Joseph K. Bradley commented on SPARK-14523:
-------------------------------------------

I'd like to keep this open until we have linked tasks for the missing functionality.

[~hujiayin] This is for parity w.r.t. the RDD-based API, not for adding new functionality to MLlib.  I think there's already a JIRA for ARIMA somewhere.

> Feature parity for Statistics ML with MLlib
> -------------------------------------------
>
>                 Key: SPARK-14523
>                 URL: https://issues.apache.org/jira/browse/SPARK-14523
>             Project: Spark
>          Issue Type: Sub-task
>          Components: ML
>            Reporter: yuhao yang
>
> Some statistics functions have been supported by DataFrame directly. Use this jira to discuss/design the statistics package in Spark.ML and its function scope. Hypothesis test and correlation computation may still need to expose independent interfaces.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org