You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Aman Sinha (JIRA)" <ji...@apache.org> on 2014/08/29 19:09:54 UTC
[jira] [Created] (DRILL-1362) Count(nullable-column) is incorrectly
pushed into group scan operator
Aman Sinha created DRILL-1362:
---------------------------------
Summary: Count(nullable-column) is incorrectly pushed into group scan operator
Key: DRILL-1362
URL: https://issues.apache.org/jira/browse/DRILL-1362
Project: Apache Drill
Issue Type: Bug
Components: Execution - Operators
Affects Versions: 0.4.0
Reporter: Aman Sinha
The following query on TPC-DS table web_returns produces wrong result because the aggregate count(wr_return_quantity) gets pushed into the parquet group scan operator even though wr_return_quantity is nullable and apparently the parquet metadata does not have stats on nullable column.
0: jdbc:drill:zk=local> select count(wr_return_quantity) from web_returns;
+------------+
| EXPR$0 |
+------------+
| 71763 |
+------------+
0: jdbc:drill:zk=local> explain plan for select count(wr_return_quantity) from web_returns;
{code:sql}
+------------+------------+
| text | json |
+------------+------------+
| 00-00 Screen
00-01 Project(EXPR$0=[$0])
00-02 Scan(groupscan=[org.apache.drill.exec.store.pojo.PojoRecordReader@e4acaad])
{code}
For reference, here are the correct results:
tpcds=# select count(wr_return_quantity) from web_returns;
count
-------
68616
(1 row)
tpcds=# select count(*) from web_returns;
count
-------
71763
(1 row)
--
This message was sent by Atlassian JIRA
(v6.2#6252)