You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Vikas Tarvecha (Jira)" <ji...@apache.org> on 2019/10/02 18:41:00 UTC

[jira] [Created] (HIVE-22283) Aggregate Function on partition column scans Full Table

Vikas Tarvecha created HIVE-22283:
-------------------------------------

             Summary: Aggregate Function on partition column scans Full Table
                 Key: HIVE-22283
                 URL: https://issues.apache.org/jira/browse/HIVE-22283
             Project: Hive
          Issue Type: Wish
            Reporter: Vikas Tarvecha


While running hive queries like "SELECT max(date_partition) FROM VERY_LARGE_FACT_TBL", where VERY_LARGE_FACT_TBL is partitioned by column 'date_partition', a full table scan is required. I have observed that if the table contains large amount of data, thousands of mappers are initiated. I wish hive could just interact with metastore to pull simple things like max, min or distinct of partition column.  I am using Hive version 2.3.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)