You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "zhangjing (JIRA)" <ji...@apache.org> on 2017/01/19 07:25:26 UTC
[jira] [Updated] (FLINK-5566) Introduce structure to hold table and
column level statistics
[ https://issues.apache.org/jira/browse/FLINK-5566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
zhangjing updated FLINK-5566:
-----------------------------
Description:
We define two structure mode to hold statistics
1. TableStats: contain stats for table level, now only one element: rowCount
2. ColumnStats: contain stats of column level.
for numeric column type: including ndv, nullCount, max, min, histogram
for string type: including ndv, nullCount, avgLen,maxLen
for boolean:including ndv, nullCount, trueCount, falseCount
for date/time/timestamp: including ndv, nullCount, max, min, histogram
> Introduce structure to hold table and column level statistics
> -------------------------------------------------------------
>
> Key: FLINK-5566
> URL: https://issues.apache.org/jira/browse/FLINK-5566
> Project: Flink
> Issue Type: Sub-task
> Components: Table API & SQL
> Reporter: Kurt Young
> Assignee: zhangjing
>
> We define two structure mode to hold statistics
> 1. TableStats: contain stats for table level, now only one element: rowCount
> 2. ColumnStats: contain stats of column level.
> for numeric column type: including ndv, nullCount, max, min, histogram
> for string type: including ndv, nullCount, avgLen,maxLen
> for boolean:including ndv, nullCount, trueCount, falseCount
> for date/time/timestamp: including ndv, nullCount, max, min, histogram
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)