You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Gopal V (JIRA)" <ji...@apache.org> on 2019/08/16 05:43:00 UTC

[jira] [Commented] (HIVE-22121) Turning on hive.tez.bucket.pruning produce wrong results

    [ https://issues.apache.org/jira/browse/HIVE-22121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16908732#comment-16908732 ] 

Gopal V commented on HIVE-22121:
--------------------------------


{code}
'            Map Operator Tree:'
'                TableScan'
'                  alias: test_table'
'                  filterExpr: (col_1 <> 2) (type: boolean)'
'                  buckets included: [] of 4'
{code}

on the explain extended, the SARG generated is 

{code}
leaf-0 = (EQUALS col_1 2), expr = (not leaf-0)
{code}

So the SARG leaf only has

{code}
[(EQUALS col_1 2)]
{code}

and expression tree does not bail out for NOT.

> Turning on hive.tez.bucket.pruning produce wrong results
> --------------------------------------------------------
>
>                 Key: HIVE-22121
>                 URL: https://issues.apache.org/jira/browse/HIVE-22121
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 3.1.0, 4.0.0
>            Reporter: Vineet Garg
>            Assignee: Vineet Garg
>            Priority: Major
>
> *Reproducer*
> {code:sql}
> set hive.query.results.cache.enabled=false;
> set hive.optimize.ppd.storage=true;
> set hive.optimize.index.filter=true;
> set hive.tez.bucket.pruning=true; 
> CREATE TABLE `test_table`(                 
>    `col_1` int,                                     
>    `col_2` string,                                  
>    `col_3` string)                                  
>  CLUSTERED BY (                                     
>    col_1)                                           
>  INTO 4 BUCKETS;                                     
> insert into test_table values(1, 'one', 'ONE'), (2, 'two', 'TWO'), (3,'three','THREE'),(4,'four','FOUR');
> select * from test_table;
> explain select col_1, col_2, col_3 from test_table where col_1 <> 2 order by col_2;
> select col_1, col_2, col_3 from test_table where col_1 <> 2 order by col_2;
> {code}
> Above sql query produce zero rows.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)