You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Zheng Hu (Jira)" <ji...@apache.org> on 2022/05/17 02:26:00 UTC
[jira] [Created] (FLINK-27655) Implement Avro File statistic collector
Zheng Hu created FLINK-27655:
--------------------------------
Summary: Implement Avro File statistic collector
Key: FLINK-27655
URL: https://issues.apache.org/jira/browse/FLINK-27655
Project: Flink
Issue Type: Sub-task
Reporter: Zheng Hu
Currently, the flink table store's avro file writer don't provide its File statistic collector. So we have to use the generic FieldStatsCollector.
In fact, the correct direction is: Making all format writer has their own FileStatsCollector, so that we can just parse the columnar statistic from the file tailer, instead of comparing each column max-min when writing the records into the columnar file.
In this way, I think we can just remove the FileFormatImpl class and FieldStatsCollector class.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)