You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-user@hadoop.apache.org by howardh <ho...@copacast.com> on 2008/11/03 04:49:35 UTC

Did Hadoop support gz/zip format file?

Hello,

I have a requirement of use gz/zip format in Hadoop,   After these days research and learning,   It seems Hadoop didn't support gz/zip file yet,    is that true?
Like I'm going to create file with gz format,  and read it later by FileSystem interface.     Does it feasible?     Experts!  Could you show me some advices?

Best Regards
2008-11-03 



howardh 

RE: Did Hadoop support gz/zip format file?

Posted by zhuweimin <xi...@tsm.kddilabs.jp>.
Hello

I have a requirement of use file with the extension .Z(the file is created
by UNIX compress command) in Hadoop.
It will too be automatically recognized / handled ?

Any suggestion on how to handle the .Z file in the map task?

Thanks
Best Regards

Alamo

-----Original Message-----
From: Abdul Qadeer [mailto:qadeer.qadeer@gmail.com] 
Sent: Monday, November 03, 2008 4:19 PM
To: core-user@hadoop.apache.org
Subject: Re: Did Hadoop support gz/zip format file?

Hadoop supports gzip format by means of its gzip codecs.
If you meant to use default input format / record reader of
Hadoop, it will automatically recognize / handle your gzip
input files.  And if you are using some other input formats / record
readers, you can use the gzip codec provided by Hadoop to
de-compress the data and then interpret it in your record reader.

On Sun, Nov 2, 2008 at 7:49 PM, howardh <ho...@copacast.com> wrote:

> Hello,
>
> I have a requirement of use gz/zip format in Hadoop,   After these days
> research and learning,   It seems Hadoop didn't support gz/zip file yet,
>  is that true?
> Like I'm going to create file with gz format,  and read it later by
> FileSystem interface.     Does it feasible?     Experts!  Could you show
me
> some advices?
>
> Best Regards
> 2008-11-03
>
>
>
> howardh
>



Re: Did Hadoop support gz/zip format file?

Posted by Abdul Qadeer <qa...@gmail.com>.
Hadoop supports gzip format by means of its gzip codecs.
If you meant to use default input format / record reader of
Hadoop, it will automatically recognize / handle your gzip
input files.  And if you are using some other input formats / record
readers, you can use the gzip codec provided by Hadoop to
de-compress the data and then interpret it in your record reader.

On Sun, Nov 2, 2008 at 7:49 PM, howardh <ho...@copacast.com> wrote:

> Hello,
>
> I have a requirement of use gz/zip format in Hadoop,   After these days
> research and learning,   It seems Hadoop didn't support gz/zip file yet,
>  is that true?
> Like I'm going to create file with gz format,  and read it later by
> FileSystem interface.     Does it feasible?     Experts!  Could you show me
> some advices?
>
> Best Regards
> 2008-11-03
>
>
>
> howardh
>