You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Rahul Challapalli (JIRA)" <ji...@apache.org> on 2015/06/27 02:21:05 UTC
[jira] [Created] (DRILL-3410) Partition Pruning : We are doing a
prune when we shouldn't
Rahul Challapalli created DRILL-3410:
----------------------------------------
Summary: Partition Pruning : We are doing a prune when we shouldn't
Key: DRILL-3410
URL: https://issues.apache.org/jira/browse/DRILL-3410
Project: Apache Drill
Issue Type: Bug
Components: Query Planning & Optimization
Reporter: Rahul Challapalli
Assignee: Jinfeng Ni
Priority: Critical
Fix For: 1.1.0
git.commit.id.abbrev=60bc945
The below plan does not look right. It should scan all the files based on the filters in the query. Also hive returned more rows than drill
{code}
explain plan for select * from `existing_partition_pruning/lineitempart` where (dir0=1993 and columns[0] >29600) or (dir0=1994 or columns[0]>29700);
| 00-00 Screen
00-01 Project(*=[$0])
00-02 Project(T70¦¦*=[$0])
00-03 SelectionVectorRemover
00-04 Filter(condition=[OR(AND(=($1, 1993), >(ITEM($2, 0), 29600)), =($1, 1994), >(ITEM($2, 0), 29700))])
00-05 Project(T70¦¦*=[$0], dir0=[$1], columns=[$2])
00-06 Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=/drill/testdata/ctas_auto_partition/existing_partition_pruning/lineitempart/0_0_3.parquet], ReadEntryWithPath [path=/drill/testdata/ctas_auto_partition/existing_partition_pruning/lineitempart/0_0_4.parquet]], selectionRoot=/drill/testdata/ctas_auto_partition/existing_partition_pruning/lineitempart, numFiles=2, columns=[`*`]]])
|
{code}
I attached the data set used. Let me know if you need anything more
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)