You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "David Mollitor (Jira)" <ji...@apache.org> on 2020/12/23 16:34:00 UTC

[jira] [Created] (HIVE-24566) Add Parquet Stats Optimization

David Mollitor created HIVE-24566:
-------------------------------------

             Summary: Add  Parquet Stats Optimization
                 Key: HIVE-24566
                 URL: https://issues.apache.org/jira/browse/HIVE-24566
             Project: Hive
          Issue Type: Improvement
            Reporter: David Mollitor


Parquet files store min/max/count data in foot metadata.

When a query is submitted to a Parquet table, and stats are not available, Hive should launch a single multi-threaded processor that simply reads the meta data of each Parquet file instead of walking through every single record in the table. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)