You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Anton Gozhiy (JIRA)" <ji...@apache.org> on 2018/03/01 09:59:00 UTC
[jira] [Created] (DRILL-6199) Filter push down doesn't work with
more than one nested subqueries
Anton Gozhiy created DRILL-6199:
-----------------------------------
Summary: Filter push down doesn't work with more than one nested subqueries
Key: DRILL-6199
URL: https://issues.apache.org/jira/browse/DRILL-6199
Project: Apache Drill
Issue Type: Bug
Affects Versions: 1.13.0
Reporter: Anton Gozhiy
Attachments: DRILL_6118_data_source.csv
*Data set:*
The data is generated used the attached file: *DRILL_6118_data_source.csv*
Data gen commands:
{code:sql}
create table dfs.tmp.`DRILL_6118_parquet_partitioned_by_folders/d1` (c1, c2, c3, c4, c5) as select cast(columns[0] as int) c1, columns[1] c2, columns[2] c3, columns[3] c4, columns[4] c5 from dfs.tmp.`DRILL_6118_data_source.csv` where columns[0] in (1, 3);
create table dfs.tmp.`DRILL_6118_parquet_partitioned_by_folders/d2` (c1, c2, c3, c4, c5) as select cast(columns[0] as int) c1, columns[1] c2, columns[2] c3, columns[3] c4, columns[4] c5 from dfs.tmp.`DRILL_6118_data_source.csv` where columns[0]=2;
create table dfs.tmp.`DRILL_6118_parquet_partitioned_by_folders/d3` (c1, c2, c3, c4, c5) as select cast(columns[0] as int) c1, columns[1] c2, columns[2] c3, columns[3] c4, columns[4] c5 from dfs.tmp.`DRILL_6118_data_source.csv` where columns[0]>3;
{code}
*Steps:*
# Execute the following query:
{code:sql}
explain plan for select * from (select * from (select * from dfs.tmp.`DRILL_6118_parquet_partitioned_by_folders`)) where c1<3
{code}
*Expected result:*
numFiles=2, numRowGroups=2, only files from the folders d1 and d2 should be scanned.
*Actual result:*
Filter push down doesn't work:
numFiles=3, numRowGroups=3, scanning from all files
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)