You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by Vivek Padmanabhan <pv...@yahoo-inc.com> on 2011/07/11 08:14:47 UTC

Review Request: PIG-2130 Piggybank:MultiStorage is not compressing output files

-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1063/
-----------------------------------------------------------

Review request for pig.


Summary
-------

MultiStorage is not compressing the records while writing the output. Even though it takes a compression param, when the record is written it ignores the compression.

As a fix enabled compressing the output.

If compression is used, then the sub diretories and output files will be having the corresponding extension.
For example, if output001.bz2 is output path and f1,f2 are the keys, the files will look like;

/tmp/output001.bz2
/tmp/output001.bz2/f1.bz2
/tmp/output001.bz2/f1.bz2/f1-0.bz2

/tmp/output001.bz2/f2.bz2
/tmp/output001.bz2/f2.bz2/f2-0.bz2


This addresses bug PIG-2130.
    https://issues.apache.org/jira/browse/PIG-2130


Diffs
-----


Diff: https://reviews.apache.org/r/1063/diff


Testing
-------


Thanks,

Vivek