You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Zheng Hu (Jira)" <ji...@apache.org> on 2022/05/17 02:26:00 UTC

[jira] [Created] (FLINK-27655) Implement Avro File statistic collector

Zheng Hu created FLINK-27655:
--------------------------------

             Summary: Implement Avro File statistic collector
                 Key: FLINK-27655
                 URL: https://issues.apache.org/jira/browse/FLINK-27655
             Project: Flink
          Issue Type: Sub-task
            Reporter: Zheng Hu


Currently, the flink table store's avro file writer don't provide its File statistic collector. So we have to use the generic FieldStatsCollector. 

In fact, the correct direction is:  Making all format writer has their own FileStatsCollector, so that we can just parse the columnar statistic from the file tailer, instead of comparing each column max-min when writing the records into the columnar file. 

In this way,  I think we can just remove the FileFormatImpl class and FieldStatsCollector class.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)