You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Lorand Bendig (JIRA)" <ji...@apache.org> on 2013/10/01 23:50:25 UTC
[jira] [Updated] (PIG-3445) Make Parquet format available out of
the box in Pig
[ https://issues.apache.org/jira/browse/PIG-3445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Lorand Bendig updated PIG-3445:
-------------------------------
Attachment: PIG-3445-2.patch
This patch attempts to address the wrapper approach.
Remarks:
- parquet jars are taken as compile-time dependencies
- The wrapper loader/storer classes ship the parquet jars to tmpjars from
the classpath (using PigContext::addJar would be probably better, but how
can it be retrieved in the LoadFunc?)
> Make Parquet format available out of the box in Pig
> ---------------------------------------------------
>
> Key: PIG-3445
> URL: https://issues.apache.org/jira/browse/PIG-3445
> Project: Pig
> Issue Type: Improvement
> Reporter: Julien Le Dem
> Fix For: 0.12.0
>
> Attachments: PIG-3445-2.patch, PIG-3445.patch
>
>
> We would add the Parquet jar in the Pig packages to make it available out of the box to pig users.
> On top of that we could add the parquet.pig package to the list of packages to search for UDFs. (alternatively, the parquet jar could contain classes name or.apache.pig.builtin.ParquetLoader and ParquetStorer)
> This way users can use Parquet simply by typing:
> A = LOAD 'foo' USING ParquetLoader();
> STORE A INTO 'bar' USING ParquetStorer();
--
This message was sent by Atlassian JIRA
(v6.1#6144)