You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/03/03 07:09:00 UTC

[jira] [Updated] (FLINK-26346) Add statistics collecting to sst files

     [ https://issues.apache.org/jira/browse/FLINK-26346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ASF GitHub Bot updated FLINK-26346:
-----------------------------------
    Labels: pull-request-available  (was: )

> Add statistics collecting to sst files
> --------------------------------------
>
>                 Key: FLINK-26346
>                 URL: https://issues.apache.org/jira/browse/FLINK-26346
>             Project: Flink
>          Issue Type: Sub-task
>          Components: Table Store
>    Affects Versions: 0.1.0
>            Reporter: Caizhi Weng
>            Priority: Major
>              Labels: pull-request-available
>
> Currently field statistics are not collected in sst files. With statistics we can do filter and other operations with better performance.
> Some formats like orc already record statistics into file headers, so for these special formats we just need to read them directly from files. For others, however, we need to collect the statistics by hand.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)