You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Nigel Daley (JIRA)" <ji...@apache.org> on 2008/11/13 00:51:44 UTC
[jira] Updated: (HADOOP-1694) lzo compressed input files not
properly recognized
[ https://issues.apache.org/jira/browse/HADOOP-1694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Nigel Daley updated HADOOP-1694:
--------------------------------
Fix Version/s: (was: 0.19.0)
> lzo compressed input files not properly recognized
> --------------------------------------------------
>
> Key: HADOOP-1694
> URL: https://issues.apache.org/jira/browse/HADOOP-1694
> Project: Hadoop Core
> Issue Type: Bug
> Components: mapred
> Affects Versions: 0.14.0
> Reporter: Nigel Daley
> Assignee: Arun C Murthy
> Attachments: part-201.txt.lzo
>
>
> When running the wordcount example with text, gzip and lzo compressed input files, the lzo compressed input files are not properly recognized and are treated as text files.
> With an input dir of
> {quote}
> /user/hadoopqa/input/part-001.txt
> /user/hadoopqa/input/part-002.txt.gz
> /user/hadoopqa/input/part-003.txt.lzo
> {quote}
> and running this command
> {quote}
> bin/hadoopqa jar hadoop-examples.jar wordcount /user/hadoopqa/input /user/hadoopqa/output
> {quote}
> I get output that looks like
> {quote}
> row 4
> royal 4
> rt$3-ex?ÔøΩ?÷µIStÔøΩ"4D%ÔøΩ9$UÔøΩÔøΩ"ÔøΩ, 1
> ru$ÔøΩÔøΩ#~t"@ÔøΩm*d#\/$ÔøΩÔøΩl.t"XÔøΩÔøΩDi" 1
> rubbÔøΩdÔøΩ&@bT 1
> rubbed 2
> {quote}
> To lzo compress the file I used lzop:
> http://www.lzop.org/download/lzop-1.01-linux_i386.tar.gz
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.