You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "ASF subversion and git services (Jira)" <ji...@apache.org> on 2022/06/19 11:53:00 UTC

[jira] [Commented] (IMPALA-11205) Implement Statistical functions : CORR(), COVAR_SAMP() and COVAR_POP()

    [ https://issues.apache.org/jira/browse/IMPALA-11205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17556062#comment-17556062 ] 

ASF subversion and git services commented on IMPALA-11205:
----------------------------------------------------------

Commit 256f37f17f62c21927e3aebee57d512758a42f81 in impala's branch refs/heads/master from pranav.lodha
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=256f37f17 ]

IMPALA-11205: Implement Statistical functions: CORR(), COVAR_SAMP()
 and COVAR_POP()

CORR() function takes two numeric type columns as arguments and returns
the Pearson's correlation coefficient between them.
COVAR_SAMP() function takes two numeric type columns and returns sample
 covariance between them.
COVAR_POP() function takes two numeric type columns and returns
 population covariance between them.
These UDAFs are tested with a few query tests written in aggregation.test.

Change-Id: I32ad627c953ba24d9cde2d5549bdd0d27a9c0d06
Reviewed-on: http://gerrit.cloudera.org:8080/18413
Reviewed-by: Impala Public Jenkins <im...@cloudera.com>
Tested-by: Impala Public Jenkins <im...@cloudera.com>


> Implement Statistical functions : CORR(), COVAR_SAMP() and COVAR_POP()
> ----------------------------------------------------------------------
>
>                 Key: IMPALA-11205
>                 URL: https://issues.apache.org/jira/browse/IMPALA-11205
>             Project: IMPALA
>          Issue Type: Sub-task
>          Components: Backend
>            Reporter: Pranav Yogi Lodha
>            Assignee: Pranav Yogi Lodha
>            Priority: Major
>
> Implement 3 statistical functions:
>  * CORR()
>  * COVAR_SAMP()
>  * COVAR_POP()
> *CORR()* : A correlation coefficient function CORR() which calculates Pearson's correlation coefficient between two columns. It should return a DOUBLE value ranging from -1 to 1 describing the correlation between the two numeric columns.
> *COVAR_SAMP()* : It returns the sample covariance between two columns of numeric types.
> *COVAR_POP()* : It returns the population covariance between two columns of numeric types.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org