You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "David Mollitor (Jira)" <ji...@apache.org> on 2020/12/23 16:34:00 UTC
[jira] [Created] (HIVE-24566) Add Parquet Stats Optimization
David Mollitor created HIVE-24566:
-------------------------------------
Summary: Add Parquet Stats Optimization
Key: HIVE-24566
URL: https://issues.apache.org/jira/browse/HIVE-24566
Project: Hive
Issue Type: Improvement
Reporter: David Mollitor
Parquet files store min/max/count data in foot metadata.
When a query is submitted to a Parquet table, and stats are not available, Hive should launch a single multi-threaded processor that simply reads the meta data of each Parquet file instead of walking through every single record in the table.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)