You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Gabor Szadovszky (JIRA)" <ji...@apache.org> on 2018/10/11 08:33:00 UTC
[jira] [Updated] (PARQUET-1365) Don't write page level statistics
for v1
[ https://issues.apache.org/jira/browse/PARQUET-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gabor Szadovszky updated PARQUET-1365:
--------------------------------------
Fix Version/s: 1.11.0
> Don't write page level statistics for v1
> ----------------------------------------
>
> Key: PARQUET-1365
> URL: https://issues.apache.org/jira/browse/PARQUET-1365
> Project: Parquet
> Issue Type: Improvement
> Components: parquet-mr
> Reporter: Gabor Szadovszky
> Assignee: Gabor Szadovszky
> Priority: Major
> Fix For: 1.11.0
>
>
> Page level statistics are never used in production and after adding column indexes they are completely useless. In page_v2 we are not able to remove the unnecessary statistics as they are required fields in the page header but in page_v1 it is optional. Let's get rid of the unnecessary and optional fields in the page headers.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)