You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "zhangjing (JIRA)" <ji...@apache.org> on 2017/01/19 07:25:26 UTC

[jira] [Updated] (FLINK-5566) Introduce structure to hold table and column level statistics

     [ https://issues.apache.org/jira/browse/FLINK-5566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

zhangjing updated FLINK-5566:
-----------------------------
    Description: 
We define two structure mode to hold statistics
1. TableStats: contain stats for table level, now only one element: rowCount
2. ColumnStats: contain stats of column level. 
for numeric column type: including ndv, nullCount, max, min, histogram
for string type: including ndv, nullCount, avgLen,maxLen
for boolean:including ndv, nullCount, trueCount, falseCount
for date/time/timestamp:  including ndv, nullCount, max, min, histogram 


> Introduce structure to hold table and column level statistics
> -------------------------------------------------------------
>
>                 Key: FLINK-5566
>                 URL: https://issues.apache.org/jira/browse/FLINK-5566
>             Project: Flink
>          Issue Type: Sub-task
>          Components: Table API & SQL
>            Reporter: Kurt Young
>            Assignee: zhangjing
>
> We define two structure mode to hold statistics
> 1. TableStats: contain stats for table level, now only one element: rowCount
> 2. ColumnStats: contain stats of column level. 
> for numeric column type: including ndv, nullCount, max, min, histogram
> for string type: including ndv, nullCount, avgLen,maxLen
> for boolean:including ndv, nullCount, trueCount, falseCount
> for date/time/timestamp:  including ndv, nullCount, max, min, histogram 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)