You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@hive.apache.org by "Guilherme Santiago Ribeiro Silva (JIRA)" <ji...@apache.org> on 2015/06/11 01:27:00 UTC

[jira] [Commented] (HIVE-600) Running TPC-H queries on Hive

    [ https://issues.apache.org/jira/browse/HIVE-600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14581226#comment-14581226 ] 

Guilherme Santiago Ribeiro Silva commented on HIVE-600:
-------------------------------------------------------

Hi Yuntao Jia, 
i'm doing a study with a Hadoop and Hive to tuning the throughput of querys and i'm using the HIVE600, thats i find here. 
But i don't understand why in querys have a DROP, CREATE and populated the relations and so the SELECTs. There are a reason to execute this scripts with this structure or i can execute only the SELECTs? 

> Running TPC-H queries on Hive
> -----------------------------
>
>                 Key: HIVE-600
>                 URL: https://issues.apache.org/jira/browse/HIVE-600
>             Project: Hive
>          Issue Type: New Feature
>            Reporter: Yuntao Jia
>            Assignee: Yuntao Jia
>         Attachments: TPC-H_on_Hive_2009-08-11.pdf, TPC-H_on_Hive_2009-08-11.tar.gz, TPC-H_on_Hive_2009-08-14.tar.gz
>
>
> The goal is to run all TPC-H (http://www.tpc.org/tpch/) benchmark queries on Hive for two reasons. First, through those queries, we would like to find the new features that we need to put into Hive so that Hive supports common SQL queries. Second, we would like to measure the performance of Hive to find out what Hive is not good at. We can then improve Hive based on those information. 
> For queries that are not supported now in Hive, I will try to rewrite them to one or more Hive-supported queries. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)