You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Jie Li (Commented) (JIRA)" <ji...@apache.org> on 2011/12/11 19:30:40 UTC

[jira] [Commented] (HIVE-600) Running TPC-H queries on Hive

    [ https://issues.apache.org/jira/browse/HIVE-600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13167164#comment-13167164 ] 

Jie Li commented on HIVE-600:
-----------------------------

Hi all, we conducted TPC-H benchmark on Pig as well and compared with Hive. Overall Hive is very efficient, but we find some of Hive's queries are suboptimal, especially for the order of joins, e.g. it's better to do small joins first. That's probably why some of Hive's queries were either super slow or failed (e.g. Q9 failed in our comparison, and was extremely slow in Hadapt's comparison).

Our results are available at https://issues.apache.org/jira/browse/PIG-2397. Hope they're helpful to Hive as well.
                
> Running TPC-H queries on Hive
> -----------------------------
>
>                 Key: HIVE-600
>                 URL: https://issues.apache.org/jira/browse/HIVE-600
>             Project: Hive
>          Issue Type: New Feature
>            Reporter: Yuntao Jia
>            Assignee: Yuntao Jia
>         Attachments: TPC-H_on_Hive_2009-08-11.pdf, TPC-H_on_Hive_2009-08-11.tar.gz, TPC-H_on_Hive_2009-08-14.tar.gz
>
>
> The goal is to run all TPC-H (http://www.tpc.org/tpch/) benchmark queries on Hive for two reasons. First, through those queries, we would like to find the new features that we need to put into Hive so that Hive supports common SQL queries. Second, we would like to measure the performance of Hive to find out what Hive is not good at. We can then improve Hive based on those information. 
> For queries that are not supported now in Hive, I will try to rewrite them to one or more Hive-supported queries. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira