You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Jesus Camacho Rodriguez (JIRA)" <ji...@apache.org> on 2017/07/11 12:58:00 UTC

[jira] [Created] (HIVE-17073) Incorrect result with vectorization and SharedWorkOptimizer

Jesus Camacho Rodriguez created HIVE-17073:
----------------------------------------------

             Summary: Incorrect result with vectorization and SharedWorkOptimizer
                 Key: HIVE-17073
                 URL: https://issues.apache.org/jira/browse/HIVE-17073
             Project: Hive
          Issue Type: Bug
          Components: Vectorization
    Affects Versions: 3.0.0
            Reporter: Jesus Camacho Rodriguez
            Assignee: Jesus Camacho Rodriguez


We get incorrect result with vectorization and multi-output Select operator created by SharedWorkOptimizer. It can be reproduced in the following way.

{code:title=Correct}
select count(*) as h8_30_to_9
  from src
  join src1 on src.key = src1.key
  where src1.value = "val_278";
OK
2
{code}

{code:title=Correct}
select count(*) as h9_to_9_30
  from src
  join src1 on src.key = src1.key
  where src1.value = "val_255";
OK
2
{code}

{code:title=Incorrect}
select * from (
  select count(*) as h8_30_to_9
  from src
  join src1 on src.key = src1.key
  where src1.value = "val_278") s1
join (
  select count(*) as h9_to_9_30
  from src
  join src1 on src.key = src1.key
  where src1.value = "val_255") s2;
OK
2	0
{code}

Problem seems to be that some ds in the batch row need to be re-initialized after they have been forwarded to each output.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)