You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Victoria Markman (JIRA)" <ji...@apache.org> on 2015/02/03 04:12:34 UTC

[jira] [Created] (DRILL-2148) Wrong result with grouping on a column of date type with streaming aggregation

Victoria Markman created DRILL-2148:
---------------------------------------

             Summary: Wrong result with grouping on a column of date type with streaming aggregation
                 Key: DRILL-2148
                 URL: https://issues.apache.org/jira/browse/DRILL-2148
             Project: Apache Drill
          Issue Type: Bug
          Components: Execution - Operators
    Affects Versions: 0.8.0
            Reporter: Victoria Markman
            Assignee: Chris Westin
            Priority: Critical


Disable hash aggregation  and run query below:
{code}

alter system set `planner.enable_hashagg` = false;

select
        c_date,
        COUNT(*)
from    t1
group by
        c_date
order by
        c_date;

{code}

You will get wrong result. Because NULLs are sorted in the middle ( see DRILL-2084 ) they are folded in one of the non related groups.
We might have the same problem with the merge join on date, time and timestamp columns.
Attached is a parquet file that was used in this query.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)