You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Dmitriy V. Ryaboy (JIRA)" <ji...@apache.org> on 2011/07/10 05:17:00 UTC

[jira] [Updated] (PIG-2143) Improvements for PigStorage

     [ https://issues.apache.org/jira/browse/PIG-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dmitriy V. Ryaboy updated PIG-2143:
-----------------------------------

    Attachment: PIG-2143.diff

Attached patch adds the code for treating an optional second parameter as a list of arguments.
It also moves PigStorageSchema code into PigStorage, controlled by a '-schema' flag, and deprecates PigStorageSchema (no plans to remove it completely yet).

I plan to add the compression and overwrite flags as well in this ticket, consider this an early draft.

> Improvements for PigStorage
> ---------------------------
>
>                 Key: PIG-2143
>                 URL: https://issues.apache.org/jira/browse/PIG-2143
>             Project: Pig
>          Issue Type: Improvement
>            Reporter: Dmitriy V. Ryaboy
>         Attachments: PIG-2143.diff
>
>
> I'd like to propose that we allow for a greater degree of customization in PigStorage.
> An incomplete list features that we might want to add:
> - flag to tell it to overwrite existing output if it exists
> - flag to tell it to compress output using gzip|bzip|lzo (currently this can be achieved by setting the directory name to end in .gz or .bz2, which is a bit awkward)
> - flag to tell it to store the schema and header (perhaps by merging in PigStorageSchema work?)

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira