You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/12/25 04:38:00 UTC

[jira] [Commented] (DRILL-8376) Add Distribution UDFs

    [ https://issues.apache.org/jira/browse/DRILL-8376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17651851#comment-17651851 ] 

ASF GitHub Bot commented on DRILL-8376:
---------------------------------------

cgivre opened a new pull request, #2729:
URL: https://github.com/apache/drill/pull/2729

   # [DRILL-8376](https://issues.apache.org/jira/browse/DRILL-8376): Add Distribution UDFs
   
   ## Description
   This PR adds several new UDFs to help with statistical analysis.  They are `width_bucket` which mirrors the functionality of the POSTGRES function of the same name. (https://www.oreilly.com/library/view/sql-in-a/9780596155322/re91.html).  This function is useful for building histograms of data.
   
   This also adds the `kendall_correlation` and `pearson_correlation` functions which are two function for calculating correlation coefficients of two columns.
   
   ## Documentation
   Updated README.
   
   ## Testing
   Added unit tests.




> Add Distribution UDFs
> ---------------------
>
>                 Key: DRILL-8376
>                 URL: https://issues.apache.org/jira/browse/DRILL-8376
>             Project: Apache Drill
>          Issue Type: Improvement
>          Components: Functions - Drill
>    Affects Versions: 1.21
>            Reporter: Charles Givre
>            Assignee: Charles Givre
>            Priority: Minor
>
> Add `width_bucket`, `pearson_correlation` and `kendall_correlation` to Drill



--
This message was sent by Atlassian Jira
(v8.20.10#820010)