You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "jiraposter@reviews.apache.org (JIRA)" <ji...@apache.org> on 2011/07/11 08:16:03 UTC

[jira] [Commented] (PIG-2130) Piggybank:MultiStorage is not compressing output files

    [ https://issues.apache.org/jira/browse/PIG-2130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13062863#comment-13062863 ] 

jiraposter@reviews.apache.org commented on PIG-2130:
----------------------------------------------------


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1063/
-----------------------------------------------------------

Review request for pig.


Summary
-------

MultiStorage is not compressing the records while writing the output. Even though it takes a compression param, when the record is written it ignores the compression.

As a fix enabled compressing the output.

If compression is used, then the sub diretories and output files will be having the corresponding extension.
For example, if output001.bz2 is output path and f1,f2 are the keys, the files will look like;

/tmp/output001.bz2
/tmp/output001.bz2/f1.bz2
/tmp/output001.bz2/f1.bz2/f1-0.bz2

/tmp/output001.bz2/f2.bz2
/tmp/output001.bz2/f2.bz2/f2-0.bz2


This addresses bug PIG-2130.
    https://issues.apache.org/jira/browse/PIG-2130


Diffs
-----


Diff: https://reviews.apache.org/r/1063/diff


Testing
-------


Thanks,

Vivek



> Piggybank:MultiStorage is not compressing output files
> ------------------------------------------------------
>
>                 Key: PIG-2130
>                 URL: https://issues.apache.org/jira/browse/PIG-2130
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.8.0, 0.9.0
>            Reporter: Vivek Padmanabhan
>            Assignee: Vivek Padmanabhan
>             Fix For: 0.9.0
>
>         Attachments: PIG-2130_1.patch
>
>
> MultiStorage is not compressing the records while writing the output. Even though it takes a compression param,  when the record is written it ignores the compression.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira