You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@drill.apache.org by "Rahul Challapalli (JIRA)" <ji...@apache.org> on 2015/11/11 20:39:10 UTC

[jira] [Updated] (DRILL-4072) Hive partition pruning not working with avro serde's

     [ https://issues.apache.org/jira/browse/DRILL-4072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rahul Challapalli updated DRILL-4072:
-------------------------------------
    Attachment: hive.ddl
                episodes.avro

> Hive partition pruning not working with avro serde's
> ----------------------------------------------------
>
>                 Key: DRILL-4072
>                 URL: https://issues.apache.org/jira/browse/DRILL-4072
>             Project: Apache Drill
>          Issue Type: Bug
>    Affects Versions: 1.3.0
>            Reporter: Rahul Challapalli
>         Attachments: episodes.avro, hive.ddl
>
>
> git.commit.id.abbrev=e78e286
> The below plan indicates that partition pruning is not happening
> {code}
> explain plan for select * from hive.episodes_partitioned where doctor > 4;
> +------+------+
> | text | json |
> +------+------+
> | 00-00    Screen
> 00-01      Project(title=[$0], air_date=[$1], doctor=[$2], doctor_pt=[$3])
> 00-02        Project(title=[$0], air_date=[$1], doctor=[$2], doctor_pt=[$3])
> 00-03          SelectionVectorRemover
> 00-04            Filter(condition=[>($2, 4)])
> 00-05              Scan(groupscan=[HiveScan [table=Table(dbName:default, tableName:episodes_partitioned), inputSplits=[maprfs:///user/hive/warehouse/episodes_partitioned/doctor_pt=1/000000_0:0+367, maprfs:///user/hive/warehouse/episodes_partitioned/doctor_pt=11/000000_0:0+393, maprfs:///user/hive/warehouse/episodes_partitioned/doctor_pt=2/000000_0:0+371, maprfs:///user/hive/warehouse/episodes_partitioned/doctor_pt=4/000000_0:0+368, maprfs:///user/hive/warehouse/episodes_partitioned/doctor_pt=5/000000_0:0+357, maprfs:///user/hive/warehouse/episodes_partitioned/doctor_pt=6/000000_0:0+370, maprfs:///user/hive/warehouse/episodes_partitioned/doctor_pt=9/000000_0:0+350], columns=[`*`], numPartitions=7, partitions= [Partition(values:[1]), Partition(values:[11]), Partition(values:[2]), Partition(values:[4]), Partition(values:[5]), Partition(values:[6]), Partition(values:[9])]]])
> {code}
> I attached the data file and the hql required. Let me know if anything else is needed



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)