You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "katty he (Jira)" <ji...@apache.org> on 2021/09/25 09:00:09 UTC
[jira] [Created] (HIVE-25557) Hive 3.1.2 with Tez is slow to clount
data in parquet format
katty he created HIVE-25557:
-------------------------------
Summary: Hive 3.1.2 with Tez is slow to clount data in parquet format
Key: HIVE-25557
URL: https://issues.apache.org/jira/browse/HIVE-25557
Project: Hive
Issue Type: Improvement
Environment: Hive 3.1.2
Tez *0.10.1*
Reporter: katty he
recently, i use test a sql like seelct count(*) from table in Hive 3.1.2 with Tez, and the table is in parquet format, normally, when counting, the query engin can read metadata instead of reading the full data, but in my case, Tez can not get count by metadata only, it will read the data, so it's slow, when count 2 billion data, tez wil use 500s , and spend 60s to initialized, ts that a problem?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)