You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Matt McCline (JIRA)" <ji...@apache.org> on 2015/09/23 00:17:04 UTC

[jira] [Updated] (HIVE-11517) Vectorized auto_smb_mapjoin_14.q produces different results

     [ https://issues.apache.org/jira/browse/HIVE-11517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Matt McCline updated HIVE-11517:
--------------------------------
    Attachment: HIVE-11517.01.patch

> Vectorized auto_smb_mapjoin_14.q produces different results
> -----------------------------------------------------------
>
>                 Key: HIVE-11517
>                 URL: https://issues.apache.org/jira/browse/HIVE-11517
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Matt McCline
>            Assignee: Matt McCline
>            Priority: Critical
>         Attachments: HIVE-11517.01.patch
>
>
> Converted Q file to use ORC and turned on vectorization.
> The query:
> {code}
> select count(*) from (
>   select a.key as key, a.value as val1, b.value as val2 from tbl1 a join tbl2 b on a.key = b.key
> ) subq1
> {code}
> produces 10 instead of 22.
> The query:
> {code}
> select src1.key, src1.cnt1, src2.cnt1 from
> (
>   select key, count(*) as cnt1 from 
>   (
>     select a.key as key, a.value as val1, b.value as val2 from tbl1 a join tbl2 b on a.key = b.key
>   ) subq1 group by key
> ) src1
> join
> (
>   select key, count(*) as cnt1 from 
>   (
>     select a.key as key, a.value as val1, b.value as val2 from tbl1 a join tbl2 b on a.key = b.key
>   ) subq2 group by key
> ) src2
> {code}
> produces:
> {code}
> 0	3	3
> 2	1	1
> 4	1	1
> 5	3	3
> 8	1	1
> 9	1	1
> {code}
> instead of:
> {code}
> 0	9	9
> 2	1	1
> 4	1	1
> 5	9	9
> 8	1	1
> 9	1	1
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)