You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Vineet Garg (JIRA)" <ji...@apache.org> on 2016/08/01 03:15:20 UTC
[jira] [Created] (HIVE-14396) CBO: Calcite Operator To Hive
Operator (Calcite Return Path): TestCliDriver count.q failure
Vineet Garg created HIVE-14396:
----------------------------------
Summary: CBO: Calcite Operator To Hive Operator (Calcite Return Path): TestCliDriver count.q failure
Key: HIVE-14396
URL: https://issues.apache.org/jira/browse/HIVE-14396
Project: Hive
Issue Type: Sub-task
Components: CBO
Reporter: Vineet Garg
Assignee: Vineet Garg
Currently there are three different failures
Set hive.cbo.returnpath.hiveop=true for all cases.
1) First case is wrong result for following query
{code:title=failure 1 Wrong result}
explain select count(1), count(*), count(a), count(b), count(c), count(d), count(distinct a), count(distinct b), count(distinct c), count(distinct d), count(distinct a,b), count(distinct b,c), count(distinct c,d), count(distinct a,d), count(distinct a,c), count(distinct b,d), count(distinct a,b,c), count(distinct b,c,d), count(distinct a,c,d), count(distinct a,b,d), count(distinct a,b,c,d) from abcd;
{code}
This occurs due to a bug in HiveCalciteUtil.getExprNodes. While looking for corresponding expression for a aggregate function's argument wrong index is being used.
2) Out of bound exception for following
{code}
set hive.map.aggr=false
explain select count(1), count(*), count(a), count(b), count(c), count(d), count(distinct a), count(distinct b), count(distinct c), count(distinct d), count(distinct a,b), count(distinct b,c), count(distinct c,d), count(distinct a,d), count(distinct a,c), count(distinct b,d), count(distinct a,b,c), count(distinct b,c,d), count(distinct a,c,d), count(distinct a,b,d), count(distinct a,b,c,d) from abcd;
{code}
The above happens while converting Calcite Aggregation to Hive's group by operator.
3) Once the above case with exception is fixed same query with hive.map.aggr=false give wrong results. Problem in this case is that while creating expression for aggregate function's argument we end up with wrong column info from underlying reduce sink operator.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)