You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Eugene Koifman (JIRA)" <ji...@apache.org> on 2017/11/06 21:44:00 UTC

[jira] [Updated] (HIVE-17921) Aggregation with struct in LLAP produces wrong result

     [ https://issues.apache.org/jira/browse/HIVE-17921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Eugene Koifman updated HIVE-17921:
----------------------------------
    Description: 
Consider 
{noformat}
select ROW__ID, count(*) from over10k_orc_bucketed group by ROW__ID having count(*) > 1;
{noformat}
 in acid_vectorization_original.q (available since HIVE-17458)

when run using TestMiniLlapCliDriver produces "NULL, N" where N varies from run to run.
The right answer is empty results set as can be seen by running
{noformat}
select ROW__ID, * from over10k_orc_bucketed where ROW__ID is null
{noformat}
in the same test.

This is with 
{noformat}
set hive.vectorized.execution.enabled=true;
set hive.vectorized.row.identifier.enabled=true;
{noformat}

It fails with TestMiniLlapCliDriver but not TestMiniTezCliDriver.  See acid_vectorization_original_tez.q which has identical query.

  was:
Consider 
{noformat}
select ROW__ID, count(*) from over10k_orc_bucketed group by ROW__ID having count(*) > 1;
{noformat}
 in acid_vectorization_original.q (available since HIVE-17458)

when run using TestMiniLlapCliDriver produces "NULL, N" where N varies from run to run.
The right answer is empty results set as can be seen by running
{noformat}
select ROW__ID, * from over10k_orc_bucketed where ROW__ID is null
{noformat}
in the same test.


> Aggregation with struct in LLAP produces wrong result
> -----------------------------------------------------
>
>                 Key: HIVE-17921
>                 URL: https://issues.apache.org/jira/browse/HIVE-17921
>             Project: Hive
>          Issue Type: Sub-task
>          Components: llap, Transactions
>    Affects Versions: 3.0.0
>            Reporter: Eugene Koifman
>
> Consider 
> {noformat}
> select ROW__ID, count(*) from over10k_orc_bucketed group by ROW__ID having count(*) > 1;
> {noformat}
>  in acid_vectorization_original.q (available since HIVE-17458)
> when run using TestMiniLlapCliDriver produces "NULL, N" where N varies from run to run.
> The right answer is empty results set as can be seen by running
> {noformat}
> select ROW__ID, * from over10k_orc_bucketed where ROW__ID is null
> {noformat}
> in the same test.
> This is with 
> {noformat}
> set hive.vectorized.execution.enabled=true;
> set hive.vectorized.row.identifier.enabled=true;
> {noformat}
> It fails with TestMiniLlapCliDriver but not TestMiniTezCliDriver.  See acid_vectorization_original_tez.q which has identical query.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)