You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Jesus Camacho Rodriguez (JIRA)" <ji...@apache.org> on 2017/07/11 12:58:00 UTC
[jira] [Created] (HIVE-17073) Incorrect result with vectorization
and SharedWorkOptimizer
Jesus Camacho Rodriguez created HIVE-17073:
----------------------------------------------
Summary: Incorrect result with vectorization and SharedWorkOptimizer
Key: HIVE-17073
URL: https://issues.apache.org/jira/browse/HIVE-17073
Project: Hive
Issue Type: Bug
Components: Vectorization
Affects Versions: 3.0.0
Reporter: Jesus Camacho Rodriguez
Assignee: Jesus Camacho Rodriguez
We get incorrect result with vectorization and multi-output Select operator created by SharedWorkOptimizer. It can be reproduced in the following way.
{code:title=Correct}
select count(*) as h8_30_to_9
from src
join src1 on src.key = src1.key
where src1.value = "val_278";
OK
2
{code}
{code:title=Correct}
select count(*) as h9_to_9_30
from src
join src1 on src.key = src1.key
where src1.value = "val_255";
OK
2
{code}
{code:title=Incorrect}
select * from (
select count(*) as h8_30_to_9
from src
join src1 on src.key = src1.key
where src1.value = "val_278") s1
join (
select count(*) as h9_to_9_30
from src
join src1 on src.key = src1.key
where src1.value = "val_255") s2;
OK
2 0
{code}
Problem seems to be that some ds in the batch row need to be re-initialized after they have been forwarded to each output.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)