You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Abhishek Girish (JIRA)" <ji...@apache.org> on 2015/05/14 20:36:00 UTC

[jira] [Commented] (DRILL-2376) UNION ALL on Aggregates with GROUP BY returns incomplete results

    [ https://issues.apache.org/jira/browse/DRILL-2376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14544166#comment-14544166 ] 

Abhishek Girish commented on DRILL-2376:
----------------------------------------

Verified on Git.Commit.ID d10769f (May 12 build)

{code:sql}
> use dfs.tpcds_sf1_parquet;
+------------+------------+
|     ok     |  summary   |
+------------+------------+
| true       | Default schema changed to [dfs.tpcds_sf1_parquet] |
+------------+------------+
1 row selected (0.091 seconds)

select x
from
(SELECT Sum(ss_ext_sales_price) x
FROM  store_sales
UNION ALL
SELECT Sum(cs_ext_sales_price) x
FROM catalog_sales) tmp
GROUP BY x;
+------------+
|     x      |
+------------+
| 3.658019159349976E9 |
| 5.26520707451017E9 |
+------------+
2 rows selected (0.904 seconds)

> use dfs.tpcds_sf1_text_views;
+------------+------------+
|     ok     |  summary   |
+------------+------------+
| true       | Default schema changed to [dfs.tpcds_sf1_text_views] |
+------------+------------+
1 row selected (0.09 seconds)

select x
from
(SELECT Sum(ss_ext_sales_price) x
FROM  store_sales
UNION ALL
SELECT Sum(cs_ext_sales_price) x
FROM catalog_sales) tmp
GROUP BY x;
+------------+
|     x      |
+------------+
| 3.658019159349976E9 |
| 5.26520707451017E9 |
+------------+
2 rows selected (3.523 seconds)
{code}

The issue is now resolved. 

> UNION ALL on Aggregates with GROUP BY returns incomplete results
> ----------------------------------------------------------------
>
>                 Key: DRILL-2376
>                 URL: https://issues.apache.org/jira/browse/DRILL-2376
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Query Planning & Optimization
>    Affects Versions: 0.9.0
>            Reporter: Abhishek Girish
>            Assignee: Sean Hsuan-Yi Chu
>             Fix For: 1.0.0
>
>         Attachments: t1.parquet, t2.parquet
>
>
> The following query returns incomplete results:
> {code:sql}
> select x
> from
> (SELECT Sum(ss_ext_sales_price) x
> FROM  store_sales
> UNION ALL
> SELECT Sum(cs_ext_sales_price) x
> FROM catalog_sales) tmp
> GROUP BY x;
> Results from Drill:
> +------------+
> |     x      |
> +------------+
> | 3658019159.35 |
> +------------+
> 1 row selected (3.474 seconds)
> Results from Postgres:
>        x       
> ---------------
>  5265207074.51
>  3658019159.35
> (2 rows)
> {code}
> Removing GROUP BY returns the right results:
> {code:sql}
> select x
> from
> (SELECT Sum(ss_ext_sales_price) x
> FROM  store_sales
> UNION ALL
> SELECT Sum(cs_ext_sales_price) x
> FROM catalog_sales) tmp;
> Results from Drill:
> +------------+
> |     x      |
> +------------+
> | 5265207074.51 |
> | 3658019159.35 |
> +------------+
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)