You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "ASF subversion and git services (Jira)" <ji...@apache.org> on 2022/06/19 11:53:00 UTC
[jira] [Commented] (IMPALA-11205) Implement Statistical functions : CORR(), COVAR_SAMP() and COVAR_POP()
[ https://issues.apache.org/jira/browse/IMPALA-11205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17556062#comment-17556062 ]
ASF subversion and git services commented on IMPALA-11205:
----------------------------------------------------------
Commit 256f37f17f62c21927e3aebee57d512758a42f81 in impala's branch refs/heads/master from pranav.lodha
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=256f37f17 ]
IMPALA-11205: Implement Statistical functions: CORR(), COVAR_SAMP()
and COVAR_POP()
CORR() function takes two numeric type columns as arguments and returns
the Pearson's correlation coefficient between them.
COVAR_SAMP() function takes two numeric type columns and returns sample
covariance between them.
COVAR_POP() function takes two numeric type columns and returns
population covariance between them.
These UDAFs are tested with a few query tests written in aggregation.test.
Change-Id: I32ad627c953ba24d9cde2d5549bdd0d27a9c0d06
Reviewed-on: http://gerrit.cloudera.org:8080/18413
Reviewed-by: Impala Public Jenkins <im...@cloudera.com>
Tested-by: Impala Public Jenkins <im...@cloudera.com>
> Implement Statistical functions : CORR(), COVAR_SAMP() and COVAR_POP()
> ----------------------------------------------------------------------
>
> Key: IMPALA-11205
> URL: https://issues.apache.org/jira/browse/IMPALA-11205
> Project: IMPALA
> Issue Type: Sub-task
> Components: Backend
> Reporter: Pranav Yogi Lodha
> Assignee: Pranav Yogi Lodha
> Priority: Major
>
> Implement 3 statistical functions:
> * CORR()
> * COVAR_SAMP()
> * COVAR_POP()
> *CORR()* : A correlation coefficient function CORR() which calculates Pearson's correlation coefficient between two columns. It should return a DOUBLE value ranging from -1 to 1 describing the correlation between the two numeric columns.
> *COVAR_SAMP()* : It returns the sample covariance between two columns of numeric types.
> *COVAR_POP()* : It returns the population covariance between two columns of numeric types.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org