You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Manoj Govindassamy (Jira)" <ji...@apache.org> on 2022/01/25 02:54:00 UTC

[jira] [Created] (HUDI-3316) HoodieColumnRangeMetadata doesn't include all Parquet chunk statistics

Manoj Govindassamy created HUDI-3316:
----------------------------------------

             Summary: HoodieColumnRangeMetadata doesn't include all Parquet chunk statistics
                 Key: HUDI-3316
                 URL: https://issues.apache.org/jira/browse/HUDI-3316
             Project: Apache Hudi
          Issue Type: Bug
          Components: writer-core
            Reporter: Manoj Govindassamy
             Fix For: 0.11.0


HoodieColumnChunkMetadata includes the following stats about a parquet column
 * columnName;
 * minValue
 * maxValue
 * numNulls

 

Parquet's ColumnChunkMetaData do have more stats and we need to include them all in our index 
 * distinct
 * num values 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)