You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2019/03/22 01:40:00 UTC

[jira] [Work logged] (HIVE-21485) Hive desc operation takes more than 100 seconds after upgrading from Hive 1.2.1 to 2.3.4

     [ https://issues.apache.org/jira/browse/HIVE-21485?focusedWorklogId=217089&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-217089 ]

ASF GitHub Bot logged work on HIVE-21485:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 22/Mar/19 01:39
            Start Date: 22/Mar/19 01:39
    Worklog Time Spent: 10m 
      Work Description: TopGunViper commented on pull request #577: HIVE-21485: Add flag to turn off fetching partition stats in DESCRIBE…
URL: https://github.com/apache/hive/pull/577
 
 
   ## What changes were proposed in this pull request?
   Hive DESCRIBE [formatted|extended] operation cost more than 100 seconds after upgrading from Hive 1.2.1 to 2.3.4. This is mainly caused by showing stats for partitioned tables which was introduced by [HIVE-16098](https://issues.apache.org/jira/browse/HIVE-16098) when the partitioned tables have a large amount of partitions.
   
   So,could we add a flag that determines whether 'DESCRIBE [EXTENDED|FORMATTED]' operation display partitioned table stats or not?
   
   ## How was this patch tested?
   Query Unit Test
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

            Worklog Id:     (was: 217089)
            Time Spent: 10m
    Remaining Estimate: 0h

> Hive desc operation takes more than 100 seconds after upgrading from Hive 1.2.1 to 2.3.4
> ----------------------------------------------------------------------------------------
>
>                 Key: HIVE-21485
>                 URL: https://issues.apache.org/jira/browse/HIVE-21485
>             Project: Hive
>          Issue Type: Bug
>          Components: CLI, Hive
>    Affects Versions: 2.3.4
>            Reporter: Qingxin Wu
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> Hive desc [formatted|extended] operation cost more than 100 seconds after upgrading from Hive 1.2.1 to 2.3.4. This is mainly caused by showing stats for partitioned tables which was introduced by HIVE-16098 when the partitioned tables have a large amount of partitions. In our case, the number of partition is 187221.
> {code:java}
> hive> desc bus.kafka_data;
> OK
> id            	        string
> ...
> d                  	    map<string,string>
> stat_date           	string
> log_id              	string
> # Partition Information
> # col_name            	data_type           	comment
> stat_date           	string
> log_id              	string
> Time taken: 115.342 seconds, Fetched: 42 row(s)
> {code}
> same operation executed in hive-1.2.1 and only cost 2 seconds.
> {code:java}
> hive> desc bus.kafka_data;
> OK
> id            	        string
> ...
> d                  	    map<string,string>
> stat_date           	string
> log_id              	string
> # Partition Information
> # col_name            	data_type           	comment
> stat_date           	string
> log_id              	string
> Time taken: 2.037 seconds, Fetched: 42 row(s)
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)