You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Rahul Challapalli (JIRA)" <ji...@apache.org> on 2015/06/26 02:34:04 UTC

[jira] [Created] (DRILL-3380) CTAS Auto Partitioning : We are not pruning when we use functions in the select list

Rahul Challapalli created DRILL-3380:
----------------------------------------

             Summary: CTAS Auto Partitioning : We are not pruning when we use functions in the select list
                 Key: DRILL-3380
                 URL: https://issues.apache.org/jira/browse/DRILL-3380
             Project: Apache Drill
          Issue Type: Bug
          Components: Query Planning & Optimization
            Reporter: Rahul Challapalli
            Assignee: Steven Phillips
            Priority: Critical


git.commit.id.abbrev=5a34d81

I used the below query to create a paritioned data set
{code}
create table `lineitem` partition by (l_moddate) as select l.*, l_shipdate - extract(day from l_shipdate) + 1 l_moddate from cp.`tpch/lineitem.parquet` l;
{code}

The plan for the below query only scans one file
{code}
explain plan for select * from `lineitem` where l_moddate = date '1994-07-01';
 00-00    Screen
00-01      Project(*=[$0])
00-02        Project(*=[$0])
00-03          Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=/drill/testdata/ctas_auto_partition/tpch_single_partition/lineitem/0_0_31.parquet]], selectionRoot=/drill/testdata/ctas_auto_partition/tpch_single_partition/lineitem, numFiles=1, columns=[`*`]]])
{code}

However the below plan indicates a full table scan
{code}
explain plan for select count(*) from `tpch_single_partition/lineitem` where l_moddate = date '1994-07-01';
00-00    Screen
00-01      StreamAgg(group=[{}], EXPR$0=[COUNT()])
00-02        Project($f0=[0])
00-03          SelectionVectorRemover
00-04            Filter(condition=[=($0, 1994-07-01)])
00-05              Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=maprfs:///drill/testdata/ctas_auto_partition/tpch_single_partition/lineitem]], selectionRoot=/drill/testdata/ctas_auto_partition/tpch_single_partition/lineitem, numFiles=1, columns=[`l_moddate`]]])
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)