You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@crunch.apache.org by "Josh Wills (JIRA)" <ji...@apache.org> on 2013/06/29 02:03:20 UTC

[jira] [Issue Comment Deleted] (CRUNCH-228) FileTargetImpl cuts off extensions of output files

     [ https://issues.apache.org/jira/browse/CRUNCH-228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Josh Wills updated CRUNCH-228:
------------------------------

    Comment: was deleted

(was: The patch I came up with, which does the error reporting in MSCROutputHandler.)
    
> FileTargetImpl cuts off extensions of output files
> --------------------------------------------------
>
>                 Key: CRUNCH-228
>                 URL: https://issues.apache.org/jira/browse/CRUNCH-228
>             Project: Crunch
>          Issue Type: Bug
>            Reporter: Dave Beech
>         Attachments: CRUNCH-228.patch, CRUNCH-228.patch
>
>
> Compressed files written by mapreduce often have extensions, e.g. '.deflate', '.gz' or '.snappy'. Crunch currently cuts off these extensions during the move of output files to their final destination, which is fine in some circumstances but causes problems in others. 
> For example, running 'hadoop fs -text myfile.deflate' will show the decompressed text on screen but running 'hadoop fs -text myfile' on a deflate-compressed file with no extension prints unreadable compressed data instead. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira