You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Matt McCline (JIRA)" <ji...@apache.org> on 2015/09/22 09:43:04 UTC

[jira] [Commented] (HIVE-11517) Vectorized auto_smb_mapjoin_14.q produces different results

    [ https://issues.apache.org/jira/browse/HIVE-11517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14902164#comment-14902164 ] 

Matt McCline commented on HIVE-11517:
-------------------------------------

Problem goes away when a copy is made of the List<Object> in VectorSMBMapJoinOperator.smbJoinComputeKeys that comes from keyEvaluator.evaluate(keyValues[batchIndex]);

> Vectorized auto_smb_mapjoin_14.q produces different results
> -----------------------------------------------------------
>
>                 Key: HIVE-11517
>                 URL: https://issues.apache.org/jira/browse/HIVE-11517
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Matt McCline
>            Assignee: Matt McCline
>            Priority: Critical
>
> Converted Q file to use ORC and turned on vectorization.
> The query:
> {code}
> select count(*) from (
>   select a.key as key, a.value as val1, b.value as val2 from tbl1 a join tbl2 b on a.key = b.key
> ) subq1
> {code}
> produces 10 instead of 22.
> The query:
> {code}
> select src1.key, src1.cnt1, src2.cnt1 from
> (
>   select key, count(*) as cnt1 from 
>   (
>     select a.key as key, a.value as val1, b.value as val2 from tbl1 a join tbl2 b on a.key = b.key
>   ) subq1 group by key
> ) src1
> join
> (
>   select key, count(*) as cnt1 from 
>   (
>     select a.key as key, a.value as val1, b.value as val2 from tbl1 a join tbl2 b on a.key = b.key
>   ) subq2 group by key
> ) src2
> {code}
> produces:
> {code}
> 0	3	3
> 2	1	1
> 4	1	1
> 5	3	3
> 8	1	1
> 9	1	1
> {code}
> instead of:
> {code}
> 0	9	9
> 2	1	1
> 4	1	1
> 5	9	9
> 8	1	1
> 9	1	1
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)