You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "godfrey he (Jira)" <ji...@apache.org> on 2019/09/27 04:40:00 UTC
[jira] [Commented] (FLINK-11711) Add table and column stats
[ https://issues.apache.org/jira/browse/FLINK-11711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939114#comment-16939114 ]
godfrey he commented on FLINK-11711:
------------------------------------
I will close this issue and will open a new issue to support any type for max/min value and other stat type like histogram
> Add table and column stats
> --------------------------
>
> Key: FLINK-11711
> URL: https://issues.apache.org/jira/browse/FLINK-11711
> Project: Flink
> Issue Type: New Feature
> Components: Table SQL / Planner
> Reporter: godfrey he
> Assignee: godfrey he
> Priority: Major
> Labels: pull-request-available
> Time Spent: 20m
> Remaining Estimate: 0h
>
> We define two structure mode to hold statistics
> 1. TableStats: statistics for table level, contains 2 elements:
> rowCount: Long // the number of row count of table
> colStats: Map[String, ColumnStats] // map each column to its ColumnStats
> 2. ColumnStats: statistics for column level, contains 6 elements:
> ndv: Long // number of distinct values
> nullCount: Long // number of null values
> avgLen: Double // average length of column values
> maxLen: Integer // max length of column values
> max: Any // max value of column values
> min: Any // min value of column values
--
This message was sent by Atlassian Jira
(v8.3.4#803005)