You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Jacques Nadeau (JIRA)" <ji...@apache.org> on 2016/03/09 07:53:40 UTC

[jira] [Updated] (DRILL-4473) Removing trivial projects reveals bugs in handling of nonexistent columns in StreamingAggregate

     [ https://issues.apache.org/jira/browse/DRILL-4473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jacques Nadeau updated DRILL-4473:
----------------------------------
    Assignee: Sean Hsuan-Yi Chu

> Removing trivial projects reveals bugs in handling of nonexistent columns in StreamingAggregate
> -----------------------------------------------------------------------------------------------
>
>                 Key: DRILL-4473
>                 URL: https://issues.apache.org/jira/browse/DRILL-4473
>             Project: Apache Drill
>          Issue Type: Bug
>            Reporter: Jacques Nadeau
>            Assignee: Sean Hsuan-Yi Chu
>
> We see a couple unit test failures in working with nonexistent columns once DRILL-4467 is fixed. This is because trivial projects no longer protect StreamingAggregate from non-existent columns. This is likely due to an incorrect check before throwing a Unsupported error. An unknown/ANY type should probably be allowed in the case of using sum/max/stddev
> {code:title=Plan before DRILL-4467}
> VOLCANO:Physical Planning (71ms):
> ScreenPrel: rowcount = 1.0, cumulative cost = {464.1 rows, 2375.1 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 185
>   ProjectPrel(col1=[$0], col2=[$1], col3=[$2], col4=[$3], col5=[$4]): rowcount = 1.0, cumulative cost = {464.0 rows, 2375.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 184
>     StreamAggPrel(group=[{}], col1=[SUM($0)], col2=[SUM($1)], col3=[SUM($2)], col4=[SUM($3)], col5=[SUM($4)]): rowcount = 1.0, cumulative cost = {464.0 rows, 2375.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 183
>       LimitPrel(offset=[0], fetch=[0]): rowcount = 1.0, cumulative cost = {463.0 rows, 2315.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 182
>         ProjectPrel(int_col=[$0], bigint_col=[$3], float4_col=[$4], float8_col=[$1], interval_year_col=[$2]): rowcount = 463.0, cumulative cost = {463.0 rows, 2315.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 181
>           ScanPrel(groupscan=[EasyGroupScan [selectionRoot=classpath:/employee.json, numFiles=1, columns=[`int_col`, `bigint_col`, `float4_col`, `float8_col`, `interval_year_col`], files=[classpath:/employee.json]]]): rowcount = 463.0, cumulative cost = {463.0 rows, 2315.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 160
> {code}
> {code:title=Plan after DRILL-4467}
> VOLCANO:Physical Planning (63ms):
> ScreenPrel: rowcount = 1.0, cumulative cost = {464.1 rows, 2375.1 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 151
>   ProjectPrel(col1=[$0], col2=[$1], col3=[$2], col4=[$3], col5=[$4]): rowcount = 1.0, cumulative cost = {464.0 rows, 2375.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 150
>     StreamAggPrel(group=[{}], col1=[SUM($0)], col2=[SUM($1)], col3=[SUM($2)], col4=[SUM($3)], col5=[SUM($4)]): rowcount = 1.0, cumulative cost = {464.0 rows, 2375.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 149
>       LimitPrel(offset=[0], fetch=[0]): rowcount = 1.0, cumulative cost = {463.0 rows, 2315.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 148
>         ScanPrel(groupscan=[EasyGroupScan [selectionRoot=classpath:/employee.json, numFiles=1, columns=[`int_col`, `bigint_col`, `float4_col`, `float8_col`, `interval_year_col`], files=[classpath:/employee.json]]]): rowcount = 463.0, cumulative cost = {463.0 rows, 2315.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 141
> Tests disabled referring to this bug in TestAggregateFunctions show multiple examples of this behavior.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)