You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Sergey Shelukhin (JIRA)" <ji...@apache.org> on 2017/02/24 01:09:44 UTC
[jira] [Resolved] (HIVE-16017) MM tables - many queries duplicate
the data after master merge
[ https://issues.apache.org/jira/browse/HIVE-16017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sergey Shelukhin resolved HIVE-16017.
-------------------------------------
Resolution: Fixed
Stupid merge issue in Utilities...
> MM tables - many queries duplicate the data after master merge
> --------------------------------------------------------------
>
> Key: HIVE-16017
> URL: https://issues.apache.org/jira/browse/HIVE-16017
> Project: Hive
> Issue Type: Sub-task
> Reporter: Sergey Shelukhin
>
> Update: happens on many more queries it looks like, and started happening after a recent master merge after I wasn't working on the feature for a while
> This duplicates the data (given that the original query is a self-union, essentially outputs it 4 times instead of 2) for either MM or non-MM tables, on MM branch.
> It seems to be adding correct inputs (esp. in non-MM case the inputs are the same as before). Presumably something in the output changes in the branch is broken for this case. Not sure what yet.
> {noformat}
> CREATE TABLE tbl1_mm(key int, value string) CLUSTERED BY (key) SORTED BY (key) INTO 2 BUCKETS;
> insert overwrite table tbl1_mm select * from src where key < 10;
> select key, value from tbl1_mm a where key < 6
> union all
> select key, value from tbl1_mm a where key < 6;
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)