You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@hive.apache.org by "katty he (Jira)" <ji...@apache.org> on 2021/09/25 09:00:09 UTC

[jira] [Created] (HIVE-25557) Hive 3.1.2 with Tez is slow to clount data in parquet format

katty he created HIVE-25557:
-------------------------------

             Summary: Hive 3.1.2 with Tez is slow to clount data in parquet format
                 Key: HIVE-25557
                 URL: https://issues.apache.org/jira/browse/HIVE-25557
             Project: Hive
          Issue Type: Improvement
         Environment: Hive 3.1.2

Tez *0.10.1*
            Reporter: katty he


recently, i use test a sql like seelct count(*) from table in Hive 3.1.2 with Tez, and the table is in parquet format, normally, when counting, the query engin can read metadata instead of reading the full data, but in my case,  Tez can not get count by metadata only, it will read the data, so it's slow, when count 2 billion data, tez wil use 500s , and spend 60s to initialized, ts that a problem?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)