You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Matt McCline (JIRA)" <ji...@apache.org> on 2015/09/22 09:43:04 UTC
[jira] [Commented] (HIVE-11517) Vectorized auto_smb_mapjoin_14.q
produces different results
[ https://issues.apache.org/jira/browse/HIVE-11517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14902164#comment-14902164 ]
Matt McCline commented on HIVE-11517:
-------------------------------------
Problem goes away when a copy is made of the List<Object> in VectorSMBMapJoinOperator.smbJoinComputeKeys that comes from keyEvaluator.evaluate(keyValues[batchIndex]);
> Vectorized auto_smb_mapjoin_14.q produces different results
> -----------------------------------------------------------
>
> Key: HIVE-11517
> URL: https://issues.apache.org/jira/browse/HIVE-11517
> Project: Hive
> Issue Type: Bug
> Reporter: Matt McCline
> Assignee: Matt McCline
> Priority: Critical
>
> Converted Q file to use ORC and turned on vectorization.
> The query:
> {code}
> select count(*) from (
> select a.key as key, a.value as val1, b.value as val2 from tbl1 a join tbl2 b on a.key = b.key
> ) subq1
> {code}
> produces 10 instead of 22.
> The query:
> {code}
> select src1.key, src1.cnt1, src2.cnt1 from
> (
> select key, count(*) as cnt1 from
> (
> select a.key as key, a.value as val1, b.value as val2 from tbl1 a join tbl2 b on a.key = b.key
> ) subq1 group by key
> ) src1
> join
> (
> select key, count(*) as cnt1 from
> (
> select a.key as key, a.value as val1, b.value as val2 from tbl1 a join tbl2 b on a.key = b.key
> ) subq2 group by key
> ) src2
> {code}
> produces:
> {code}
> 0 3 3
> 2 1 1
> 4 1 1
> 5 3 3
> 8 1 1
> 9 1 1
> {code}
> instead of:
> {code}
> 0 9 9
> 2 1 1
> 4 1 1
> 5 9 9
> 8 1 1
> 9 1 1
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)