You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by "mapleFU (via GitHub)" <gi...@apache.org> on 2023/06/08 05:58:19 UTC

[GitHub] [arrow] mapleFU commented on pull request #35989: GH-34351: [C++][Parquet] Statistic: tiny optimization and specification

mapleFU commented on PR #35989:
URL: https://github.com/apache/arrow/pull/35989#issuecomment-1581935377

   Some notes:
   1. Builder of `Statistics` will always have `has_null_count_`, and ignore `has_distinct_count_` (not build when building `EncodedStatistics`). `has_min_max_` will be `false` if page is all nulls or nulls and NaN
   2. Reader will able to set `distinct_count` and `has_distinct_count`. Which would be valid, but when it call `Encoded`, the status will not be included in `EncodedStatistics`
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org