You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Deneche A. Hakim (JIRA)" <ji...@apache.org> on 2015/05/28 17:43:21 UTC

[jira] [Commented] (DRILL-3196) Disable multiple partition by clauses in the same sql query

    [ https://issues.apache.org/jira/browse/DRILL-3196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14563117#comment-14563117 ] 

Deneche A. Hakim commented on DRILL-3196:
-----------------------------------------

Actually looking at the plan it seems valid. I tried this modified query on a small test file and the results looks correct:
{noformat}
0: jdbc:drill:zk=local> select a2, b2, c2, sum(a2) over(partition by a2 order by a2) sum_a2, count(*) over(partition by a2,b2,c2) count_all from `window_test.json` order by 4,5;
+-----+-----+-----+---------+------------+
| a2  | b2  | c2  | sum_a2  | count_all  |
+-----+-----+-----+---------+------------+
| 0   | 0   | 0   | 0       | 1          |
| 0   | 1   | 0   | 0       | 1          |
| 0   | 0   | 1   | 0       | 2          |
| 0   | 0   | 1   | 0       | 2          |
| 1   | 0   | 1   | 2       | 1          |
| 1   | 2   | 2   | 2       | 1          |
| 3   | 1   | 2   | 3       | 1          |
| 2   | 2   | 3   | 4       | 1          |
| 2   | 2   | 4   | 4       | 1          |
+-----+-----+-----+---------+------------+
{noformat}

> Disable multiple partition by clauses in the same sql query
> -----------------------------------------------------------
>
>                 Key: DRILL-3196
>                 URL: https://issues.apache.org/jira/browse/DRILL-3196
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Query Planning & Optimization
>    Affects Versions: 1.0.0
>            Reporter: Victoria Markman
>            Assignee: Sean Hsuan-Yi Chu
>            Priority: Critical
>              Labels: window_function
>
> Currently these queries parse and execute, but plan does not look correct.
> {code}
> 0: jdbc:drill:schema=dfs> explain plan for  select sum(a2) over(partition by a2 order by a2), count(*) over(partition by a2,b2,c2)  from t2 order by 1,2; 
> +------+------+
> | text | json |
> +------+------+
> | 00-00    Screen
> 00-01      Project(EXPR$0=[$0], EXPR$1=[$1])
> 00-02        SelectionVectorRemover
> 00-03          Sort(sort0=[$0], sort1=[$1], dir0=[ASC], dir1=[ASC])
> 00-04            Project(EXPR$0=[CASE(>($3, 0), CAST($4):ANY, null)], EXPR$1=[$5])
> 00-05              Window(window#0=[window(partition {0, 1, 2} order by [] range between UNBOUNDED PRECEDING and UNBOUNDED FOLLOWING aggs [COUNT()])])
> 00-06                SelectionVectorRemover
> 00-07                  Sort(sort0=[$0], sort1=[$1], sort2=[$2], dir0=[ASC], dir1=[ASC], dir2=[ASC])
> 00-08                    Window(window#0=[window(partition {0} order by [0] range between UNBOUNDED PRECEDING and CURRENT ROW aggs [COUNT($0), $SUM0($0)])])
> 00-09                      SelectionVectorRemover
> 00-10                        Sort(sort0=[$0], sort1=[$0], dir0=[ASC], dir1=[ASC])
> 00-11                          Project(a2=[$1], b2=[$0], c2=[$2])
> 00-12                            Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=maprfs:///drill/testdata/aggregation/t2]], selectionRoot=/drill/testdata/aggregation/t2, numFiles=1, columns=[`a2`, `b2`, `c2`]]])
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)