You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Shuaishuai Nie (JIRA)" <ji...@apache.org> on 2013/08/08 00:44:49 UTC
[jira] [Created] (HIVE-5023) Hive get wrong result when partition
has the same path but different schema or authority
Shuaishuai Nie created HIVE-5023:
------------------------------------
Summary: Hive get wrong result when partition has the same path but different schema or authority
Key: HIVE-5023
URL: https://issues.apache.org/jira/browse/HIVE-5023
Project: Hive
Issue Type: Bug
Reporter: Shuaishuai Nie
Hive does not differentiate scheme and authority in file uris which cause wrong result when partition has the same path but different schema or authority. Here is a simple repro
partition file path:
asv://container1@secondary1.blob.core.windows.net/2013-08-05/00/text1.txt
with content "2013-08-05 00:00:00"
asv://container2@secondary1.blob.core.windows.net/2013-08-05/00/text2.txt
with content "2013-08-05 00:00:20"
{noformat}
CREATE EXTERNAL TABLE IF NOT EXISTS T1 (t STRING) PARTITIONED BY (ProcessDate STRING, Hour STRING, ClusterName STRING) ROW FORMAT DELIMITED FIELDS TERMINATED by '\t' STORED AS TEXTFILE;
ALTER TABLE T1 DROP IF EXISTS PARTITION(processDate='2013-08-05', Hour='00', clusterName ='CLusterA');
ALTER TABLE T1 ADD IF NOT EXISTS PARTITION(processDate='2013-08-05', Hour='00', clusterName ='ClusterA') LOCATION 'asv://container1@secondary1.blob.core.windows.net/2013-08-05/00';
ALTER TABLE T1 DROP IF EXISTS PARTITION(processDate='2013-08-05', Hour='00', clusterName ='ClusterB');
ALTER TABLE T1 ADD IF NOT EXISTS PARTITION(processDate='2013-08-05', Hour='00', clusterName ='ClusterB') LOCATION 'asv://container2@secondary1.blob.core.windows.net/2013-08-05/00';
{noformat}
the expect output of the hive query
{noformat}
SELECT ClusterName, t FROM T1 WHERE ProcessDate=’2013-08-05’ AND Hour=’00’;
{noformat}
should be
{noformat}
ClusterA 2013-08-05 00:00:00
ClusterB 2013-08-05 00:00:20
{noformat}
However it is
{noformat}
ClusterA 2013-08-05 00:00:00
ClusterA 2013-08-05 00:00:20
{noformat}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira