You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Andrew Musselman (JIRA)" <ji...@apache.org> on 2014/02/10 22:07:20 UTC
[jira] [Created] (PIG-3760) Partition pruning for ORC and Parquet
Andrew Musselman created PIG-3760:
-------------------------------------
Summary: Partition pruning for ORC and Parquet
Key: PIG-3760
URL: https://issues.apache.org/jira/browse/PIG-3760
Project: Pig
Issue Type: New Feature
Affects Versions: 0.12.0
Reporter: Andrew Musselman
Priority: Minor
Fix For: 0.13.0
>From the conversation on dev@pig:
"Partition pruning for ORC is not addressed in PIG-3558. We will need
to do partition pruning for both ORC and Parquet in a new ticket.
Curently there is no interface to deal with this kind of pushdown
(LoadMetadata.setPartitionFilter push the filter to loader, but remove
the filter statement, for ORC/Parquet, filter is a hint, and we need
to do the filter again in Pig even it is pushed to loader), we will
need to define a new interface for that. You are welcome to initiate
the work. I know Aniket is also interested in doing that, so be sure
the talk with him about this work.
Thanks,
Daniel
On Mon, Feb 10, 2014 at 11:42 AM, Andrew Musselman
<an...@gmail.com> wrote:
> I had a chat with a couple people last week about a feature request for
> Pig: in a "where" or "filter" clause, when loading an ORC file, to skip
> directly to the right offset instead of scanning the whole file."
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)