You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Narine Kokhlikyan (JIRA)" <ji...@apache.org> on 2015/09/29 00:57:04 UTC

[jira] [Commented] (SPARK-10045) Add support for DataFrameStatFunctions in SparkR

    [ https://issues.apache.org/jira/browse/SPARK-10045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14934268#comment-14934268 ] 

Narine Kokhlikyan commented on SPARK-10045:
-------------------------------------------

Hi everyone,

how about standard deviation, variance, quantiles and other statistical functions?

We can get stddev from summary, but I'm not sure that it is the right way to go. I was thinking that there should be methods like stddev(), var() ... etc which we could explicitly call from DataFrame API.

Thanks,
Narine
 

> Add support for DataFrameStatFunctions in SparkR
> ------------------------------------------------
>
>                 Key: SPARK-10045
>                 URL: https://issues.apache.org/jira/browse/SPARK-10045
>             Project: Spark
>          Issue Type: New Feature
>          Components: SparkR
>            Reporter: Sun Rui
>
> The stat functions are defined in http://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.DataFrameStatFunctions.
> Currently only crosstab() is supported.
> Functions to be supported include:
> corr, cov, freqItems



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org