You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Deneche A. Hakim (JIRA)" <ji...@apache.org> on 2015/05/28 17:43:21 UTC
[jira] [Commented] (DRILL-3196) Disable multiple partition by
clauses in the same sql query
[ https://issues.apache.org/jira/browse/DRILL-3196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14563117#comment-14563117 ]
Deneche A. Hakim commented on DRILL-3196:
-----------------------------------------
Actually looking at the plan it seems valid. I tried this modified query on a small test file and the results looks correct:
{noformat}
0: jdbc:drill:zk=local> select a2, b2, c2, sum(a2) over(partition by a2 order by a2) sum_a2, count(*) over(partition by a2,b2,c2) count_all from `window_test.json` order by 4,5;
+-----+-----+-----+---------+------------+
| a2 | b2 | c2 | sum_a2 | count_all |
+-----+-----+-----+---------+------------+
| 0 | 0 | 0 | 0 | 1 |
| 0 | 1 | 0 | 0 | 1 |
| 0 | 0 | 1 | 0 | 2 |
| 0 | 0 | 1 | 0 | 2 |
| 1 | 0 | 1 | 2 | 1 |
| 1 | 2 | 2 | 2 | 1 |
| 3 | 1 | 2 | 3 | 1 |
| 2 | 2 | 3 | 4 | 1 |
| 2 | 2 | 4 | 4 | 1 |
+-----+-----+-----+---------+------------+
{noformat}
> Disable multiple partition by clauses in the same sql query
> -----------------------------------------------------------
>
> Key: DRILL-3196
> URL: https://issues.apache.org/jira/browse/DRILL-3196
> Project: Apache Drill
> Issue Type: Bug
> Components: Query Planning & Optimization
> Affects Versions: 1.0.0
> Reporter: Victoria Markman
> Assignee: Sean Hsuan-Yi Chu
> Priority: Critical
> Labels: window_function
>
> Currently these queries parse and execute, but plan does not look correct.
> {code}
> 0: jdbc:drill:schema=dfs> explain plan for select sum(a2) over(partition by a2 order by a2), count(*) over(partition by a2,b2,c2) from t2 order by 1,2;
> +------+------+
> | text | json |
> +------+------+
> | 00-00 Screen
> 00-01 Project(EXPR$0=[$0], EXPR$1=[$1])
> 00-02 SelectionVectorRemover
> 00-03 Sort(sort0=[$0], sort1=[$1], dir0=[ASC], dir1=[ASC])
> 00-04 Project(EXPR$0=[CASE(>($3, 0), CAST($4):ANY, null)], EXPR$1=[$5])
> 00-05 Window(window#0=[window(partition {0, 1, 2} order by [] range between UNBOUNDED PRECEDING and UNBOUNDED FOLLOWING aggs [COUNT()])])
> 00-06 SelectionVectorRemover
> 00-07 Sort(sort0=[$0], sort1=[$1], sort2=[$2], dir0=[ASC], dir1=[ASC], dir2=[ASC])
> 00-08 Window(window#0=[window(partition {0} order by [0] range between UNBOUNDED PRECEDING and CURRENT ROW aggs [COUNT($0), $SUM0($0)])])
> 00-09 SelectionVectorRemover
> 00-10 Sort(sort0=[$0], sort1=[$0], dir0=[ASC], dir1=[ASC])
> 00-11 Project(a2=[$1], b2=[$0], c2=[$2])
> 00-12 Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=maprfs:///drill/testdata/aggregation/t2]], selectionRoot=/drill/testdata/aggregation/t2, numFiles=1, columns=[`a2`, `b2`, `c2`]]])
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)