You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Shuaishuai Nie (JIRA)" <ji...@apache.org> on 2013/08/08 00:44:49 UTC

[jira] [Created] (HIVE-5023) Hive get wrong result when partition has the same path but different schema or authority

Shuaishuai Nie created HIVE-5023:
------------------------------------

             Summary: Hive get wrong result when partition has the same path but different schema or authority
                 Key: HIVE-5023
                 URL: https://issues.apache.org/jira/browse/HIVE-5023
             Project: Hive
          Issue Type: Bug
            Reporter: Shuaishuai Nie


Hive does not differentiate scheme and authority in file uris which cause wrong result when partition has the same path but different schema or authority. Here is a simple repro

partition file path:
asv://container1@secondary1.blob.core.windows.net/2013-08-05/00/text1.txt
with content "2013-08-05 00:00:00"
asv://container2@secondary1.blob.core.windows.net/2013-08-05/00/text2.txt
with content "2013-08-05 00:00:20"

{noformat}
CREATE EXTERNAL TABLE IF NOT EXISTS T1 (t STRING) PARTITIONED BY (ProcessDate STRING, Hour STRING, ClusterName STRING) ROW FORMAT DELIMITED FIELDS TERMINATED by '\t' STORED AS TEXTFILE;
ALTER TABLE T1 DROP IF EXISTS PARTITION(processDate='2013-08-05', Hour='00', clusterName ='CLusterA');
ALTER TABLE T1 ADD IF NOT EXISTS PARTITION(processDate='2013-08-05', Hour='00', clusterName ='ClusterA') LOCATION 'asv://container1@secondary1.blob.core.windows.net/2013-08-05/00';
ALTER TABLE T1 DROP IF EXISTS PARTITION(processDate='2013-08-05', Hour='00', clusterName ='ClusterB');
ALTER TABLE T1 ADD IF NOT EXISTS PARTITION(processDate='2013-08-05', Hour='00', clusterName ='ClusterB') LOCATION 'asv://container2@secondary1.blob.core.windows.net/2013-08-05/00';
{noformat}

the expect output of the hive query
{noformat}
SELECT ClusterName, t FROM T1 WHERE ProcessDate=’2013-08-05’ AND Hour=’00’;
{noformat}
should be
{noformat}
ClusterA        2013-08-05 00:00:00
ClusterB        2013-08-05 00:00:20
{noformat}
However it is
{noformat}
ClusterA        2013-08-05 00:00:00
ClusterA        2013-08-05 00:00:20
{noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira