You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/12/25 04:38:00 UTC
[jira] [Commented] (DRILL-8376) Add Distribution UDFs
[ https://issues.apache.org/jira/browse/DRILL-8376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17651851#comment-17651851 ]
ASF GitHub Bot commented on DRILL-8376:
---------------------------------------
cgivre opened a new pull request, #2729:
URL: https://github.com/apache/drill/pull/2729
# [DRILL-8376](https://issues.apache.org/jira/browse/DRILL-8376): Add Distribution UDFs
## Description
This PR adds several new UDFs to help with statistical analysis. They are `width_bucket` which mirrors the functionality of the POSTGRES function of the same name. (https://www.oreilly.com/library/view/sql-in-a/9780596155322/re91.html). This function is useful for building histograms of data.
This also adds the `kendall_correlation` and `pearson_correlation` functions which are two function for calculating correlation coefficients of two columns.
## Documentation
Updated README.
## Testing
Added unit tests.
> Add Distribution UDFs
> ---------------------
>
> Key: DRILL-8376
> URL: https://issues.apache.org/jira/browse/DRILL-8376
> Project: Apache Drill
> Issue Type: Improvement
> Components: Functions - Drill
> Affects Versions: 1.21
> Reporter: Charles Givre
> Assignee: Charles Givre
> Priority: Minor
>
> Add `width_bucket`, `pearson_correlation` and `kendall_correlation` to Drill
--
This message was sent by Atlassian Jira
(v8.20.10#820010)