You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Matt McCline (JIRA)" <ji...@apache.org> on 2016/02/19 14:01:18 UTC

[jira] [Updated] (HIVE-13084) Vectorization throws exception where there is case statement in group by

     [ https://issues.apache.org/jira/browse/HIVE-13084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Matt McCline updated HIVE-13084:
--------------------------------
    Attachment: vector_between_date.q

> Vectorization throws exception where there is case statement in group by
> ------------------------------------------------------------------------
>
>                 Key: HIVE-13084
>                 URL: https://issues.apache.org/jira/browse/HIVE-13084
>             Project: Hive
>          Issue Type: Bug
>          Components: Vectorization
>            Reporter: Rajesh Balamohan
>            Assignee: Matt McCline
>         Attachments: vector_between_date.q
>
>
> When there is case statement in group by, hive throws unable to vectorize exception.
> e.g query just to demonstrate the problem
> {noformat}
> explain select l_partkey, case when l_commitdate between '2015-06-30' AND '2015-07-06' THEN '2015-06-30' END as wk from lineitem_test_l_shipdate_ts group by l_partkey, case when l_commitdate between '2015-06-30' AND '2015-07-06' THEN '2015-06-30' END;
> org.apache.hadoop.hive.ql.metadata.HiveException: Could not vectorize expression: org.apache.hadoop.hive.ql.plan.ExprNodeGenericFuncDesc
> Vertex dependency in root stage
> Reducer 2 <- Map 1 (SIMPLE_EDGE)
> Stage-0
>   Fetch Operator
>     limit:-1
>     Stage-1
>       Reducer 2
>       File Output Operator [FS_7]
>         Group By Operator [GBY_5] (rows=888777234 width=108)
>           Output:["_col0","_col1"],keys:KEY._col0, KEY._col1
>         <-Map 1 [SIMPLE_EDGE]
>           SHUFFLE [RS_4]
>             PartitionCols:_col0, _col1
>             Group By Operator [GBY_3] (rows=1777554469 width=108)
>               Output:["_col0","_col1"],keys:_col0, _col1
>               Select Operator [SEL_1] (rows=1777554469 width=108)
>                 Output:["_col0","_col1"]
>                 TableScan [TS_0] (rows=1777554469 width=108)
>                   rajesh@lineitem_test_l_shipdate_ts,lineitem_test_l_shipdate_ts,Tbl:COMPLETE,Col:NONE,Output:["l_partkey","l_commitdate"]
> {noformat}
> \cc [~mmccline], [~gopalv]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)