You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Victoria Markman (JIRA)" <ji...@apache.org> on 2015/02/03 04:12:34 UTC
[jira] [Created] (DRILL-2148) Wrong result with grouping on a
column of date type with streaming aggregation
Victoria Markman created DRILL-2148:
---------------------------------------
Summary: Wrong result with grouping on a column of date type with streaming aggregation
Key: DRILL-2148
URL: https://issues.apache.org/jira/browse/DRILL-2148
Project: Apache Drill
Issue Type: Bug
Components: Execution - Operators
Affects Versions: 0.8.0
Reporter: Victoria Markman
Assignee: Chris Westin
Priority: Critical
Disable hash aggregation and run query below:
{code}
alter system set `planner.enable_hashagg` = false;
select
c_date,
COUNT(*)
from t1
group by
c_date
order by
c_date;
{code}
You will get wrong result. Because NULLs are sorted in the middle ( see DRILL-2084 ) they are folded in one of the non related groups.
We might have the same problem with the merge join on date, time and timestamp columns.
Attached is a parquet file that was used in this query.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)