You are viewing a plain text version of this content. The canonical link for it is here.
Posted to hdfs-user@hadoop.apache.org by Flavio Dias <fl...@gmail.com> on 2012/08/08 14:43:26 UTC

text file to sequence file

My text file works with one line per record with numerical data. How do I
convert this text file to sequence file?

-- 
Flávio Dias
FlavioDiasPS@gmail.com

Re: text file to sequence file

Posted by Harit Himanshu <ha...@gmail.com>.
cool, thanks Tariq

On Aug 8, 2012, at 8:52 AM, Mohammad Tariq wrote:

> Hello Harit,
> 
>   SequenceFile is a flat file consisting of binary key/value pairs.
> Since, our data is already is in key/value format it is highly
> efficient to run MapReduce jobs on these files. You can get complete
> info here - http://wiki.apache.org/hadoop/SequenceFile/
> 
> Regards,
>    Mohammad Tariq
> 
> 
> On Wed, Aug 8, 2012 at 9:20 PM, Harit Himanshu
> <ha...@gmail.com> wrote:
>> quick question, what is sequence file?
>> 
>> On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:
>> 
>> If you have large number of files and using MapReduce to do the conversion
>> to Sequence Files, set the output format of the MR job as
>> SequenceFileOutputFormat.
>> Regards
>> Bejoy KS
>> 
>> Sent from handheld, please excuse typos.
>> ________________________________
>> From: Flavio Dias <fl...@gmail.com>
>> Date: Wed, 8 Aug 2012 09:43:26 -0300
>> To: <us...@hadoop.apache.org>
>> ReplyTo: user@hadoop.apache.org
>> Subject: text file to sequence file
>> 
>> My text file works with one line per record with numerical data. How do I
>> convert this text file to sequence file?
>> 
>> --
>> Flávio Dias
>> FlavioDiasPS@gmail.com
>> 
>> 


Re: text file to sequence file

Posted by Harit Himanshu <ha...@gmail.com>.
cool, thanks Tariq

On Aug 8, 2012, at 8:52 AM, Mohammad Tariq wrote:

> Hello Harit,
> 
>   SequenceFile is a flat file consisting of binary key/value pairs.
> Since, our data is already is in key/value format it is highly
> efficient to run MapReduce jobs on these files. You can get complete
> info here - http://wiki.apache.org/hadoop/SequenceFile/
> 
> Regards,
>    Mohammad Tariq
> 
> 
> On Wed, Aug 8, 2012 at 9:20 PM, Harit Himanshu
> <ha...@gmail.com> wrote:
>> quick question, what is sequence file?
>> 
>> On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:
>> 
>> If you have large number of files and using MapReduce to do the conversion
>> to Sequence Files, set the output format of the MR job as
>> SequenceFileOutputFormat.
>> Regards
>> Bejoy KS
>> 
>> Sent from handheld, please excuse typos.
>> ________________________________
>> From: Flavio Dias <fl...@gmail.com>
>> Date: Wed, 8 Aug 2012 09:43:26 -0300
>> To: <us...@hadoop.apache.org>
>> ReplyTo: user@hadoop.apache.org
>> Subject: text file to sequence file
>> 
>> My text file works with one line per record with numerical data. How do I
>> convert this text file to sequence file?
>> 
>> --
>> Flávio Dias
>> FlavioDiasPS@gmail.com
>> 
>> 


Re: text file to sequence file

Posted by Harit Himanshu <ha...@gmail.com>.
cool, thanks Tariq

On Aug 8, 2012, at 8:52 AM, Mohammad Tariq wrote:

> Hello Harit,
> 
>   SequenceFile is a flat file consisting of binary key/value pairs.
> Since, our data is already is in key/value format it is highly
> efficient to run MapReduce jobs on these files. You can get complete
> info here - http://wiki.apache.org/hadoop/SequenceFile/
> 
> Regards,
>    Mohammad Tariq
> 
> 
> On Wed, Aug 8, 2012 at 9:20 PM, Harit Himanshu
> <ha...@gmail.com> wrote:
>> quick question, what is sequence file?
>> 
>> On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:
>> 
>> If you have large number of files and using MapReduce to do the conversion
>> to Sequence Files, set the output format of the MR job as
>> SequenceFileOutputFormat.
>> Regards
>> Bejoy KS
>> 
>> Sent from handheld, please excuse typos.
>> ________________________________
>> From: Flavio Dias <fl...@gmail.com>
>> Date: Wed, 8 Aug 2012 09:43:26 -0300
>> To: <us...@hadoop.apache.org>
>> ReplyTo: user@hadoop.apache.org
>> Subject: text file to sequence file
>> 
>> My text file works with one line per record with numerical data. How do I
>> convert this text file to sequence file?
>> 
>> --
>> Flávio Dias
>> FlavioDiasPS@gmail.com
>> 
>> 


Re: text file to sequence file

Posted by Harit Himanshu <ha...@gmail.com>.
cool, thanks Tariq

On Aug 8, 2012, at 8:52 AM, Mohammad Tariq wrote:

> Hello Harit,
> 
>   SequenceFile is a flat file consisting of binary key/value pairs.
> Since, our data is already is in key/value format it is highly
> efficient to run MapReduce jobs on these files. You can get complete
> info here - http://wiki.apache.org/hadoop/SequenceFile/
> 
> Regards,
>    Mohammad Tariq
> 
> 
> On Wed, Aug 8, 2012 at 9:20 PM, Harit Himanshu
> <ha...@gmail.com> wrote:
>> quick question, what is sequence file?
>> 
>> On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:
>> 
>> If you have large number of files and using MapReduce to do the conversion
>> to Sequence Files, set the output format of the MR job as
>> SequenceFileOutputFormat.
>> Regards
>> Bejoy KS
>> 
>> Sent from handheld, please excuse typos.
>> ________________________________
>> From: Flavio Dias <fl...@gmail.com>
>> Date: Wed, 8 Aug 2012 09:43:26 -0300
>> To: <us...@hadoop.apache.org>
>> ReplyTo: user@hadoop.apache.org
>> Subject: text file to sequence file
>> 
>> My text file works with one line per record with numerical data. How do I
>> convert this text file to sequence file?
>> 
>> --
>> Flávio Dias
>> FlavioDiasPS@gmail.com
>> 
>> 


Re: text file to sequence file

Posted by Mohammad Tariq <do...@gmail.com>.
Hello Harit,

   SequenceFile is a flat file consisting of binary key/value pairs.
Since, our data is already is in key/value format it is highly
efficient to run MapReduce jobs on these files. You can get complete
info here - http://wiki.apache.org/hadoop/SequenceFile/

Regards,
    Mohammad Tariq


On Wed, Aug 8, 2012 at 9:20 PM, Harit Himanshu
<ha...@gmail.com> wrote:
> quick question, what is sequence file?
>
> On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:
>
> If you have large number of files and using MapReduce to do the conversion
> to Sequence Files, set the output format of the MR job as
> SequenceFileOutputFormat.
> Regards
> Bejoy KS
>
> Sent from handheld, please excuse typos.
> ________________________________
> From: Flavio Dias <fl...@gmail.com>
> Date: Wed, 8 Aug 2012 09:43:26 -0300
> To: <us...@hadoop.apache.org>
> ReplyTo: user@hadoop.apache.org
> Subject: text file to sequence file
>
> My text file works with one line per record with numerical data. How do I
> convert this text file to sequence file?
>
> --
> Flávio Dias
> FlavioDiasPS@gmail.com
>
>

Re: text file to sequence file

Posted by Mohammad Tariq <do...@gmail.com>.
Hello Harit,

   SequenceFile is a flat file consisting of binary key/value pairs.
Since, our data is already is in key/value format it is highly
efficient to run MapReduce jobs on these files. You can get complete
info here - http://wiki.apache.org/hadoop/SequenceFile/

Regards,
    Mohammad Tariq


On Wed, Aug 8, 2012 at 9:20 PM, Harit Himanshu
<ha...@gmail.com> wrote:
> quick question, what is sequence file?
>
> On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:
>
> If you have large number of files and using MapReduce to do the conversion
> to Sequence Files, set the output format of the MR job as
> SequenceFileOutputFormat.
> Regards
> Bejoy KS
>
> Sent from handheld, please excuse typos.
> ________________________________
> From: Flavio Dias <fl...@gmail.com>
> Date: Wed, 8 Aug 2012 09:43:26 -0300
> To: <us...@hadoop.apache.org>
> ReplyTo: user@hadoop.apache.org
> Subject: text file to sequence file
>
> My text file works with one line per record with numerical data. How do I
> convert this text file to sequence file?
>
> --
> Flávio Dias
> FlavioDiasPS@gmail.com
>
>

Re: text file to sequence file

Posted by Mohammad Tariq <do...@gmail.com>.
Hello Harit,

   SequenceFile is a flat file consisting of binary key/value pairs.
Since, our data is already is in key/value format it is highly
efficient to run MapReduce jobs on these files. You can get complete
info here - http://wiki.apache.org/hadoop/SequenceFile/

Regards,
    Mohammad Tariq


On Wed, Aug 8, 2012 at 9:20 PM, Harit Himanshu
<ha...@gmail.com> wrote:
> quick question, what is sequence file?
>
> On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:
>
> If you have large number of files and using MapReduce to do the conversion
> to Sequence Files, set the output format of the MR job as
> SequenceFileOutputFormat.
> Regards
> Bejoy KS
>
> Sent from handheld, please excuse typos.
> ________________________________
> From: Flavio Dias <fl...@gmail.com>
> Date: Wed, 8 Aug 2012 09:43:26 -0300
> To: <us...@hadoop.apache.org>
> ReplyTo: user@hadoop.apache.org
> Subject: text file to sequence file
>
> My text file works with one line per record with numerical data. How do I
> convert this text file to sequence file?
>
> --
> Flávio Dias
> FlavioDiasPS@gmail.com
>
>

Re: text file to sequence file

Posted by Mohammad Tariq <do...@gmail.com>.
Hello Harit,

   SequenceFile is a flat file consisting of binary key/value pairs.
Since, our data is already is in key/value format it is highly
efficient to run MapReduce jobs on these files. You can get complete
info here - http://wiki.apache.org/hadoop/SequenceFile/

Regards,
    Mohammad Tariq


On Wed, Aug 8, 2012 at 9:20 PM, Harit Himanshu
<ha...@gmail.com> wrote:
> quick question, what is sequence file?
>
> On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:
>
> If you have large number of files and using MapReduce to do the conversion
> to Sequence Files, set the output format of the MR job as
> SequenceFileOutputFormat.
> Regards
> Bejoy KS
>
> Sent from handheld, please excuse typos.
> ________________________________
> From: Flavio Dias <fl...@gmail.com>
> Date: Wed, 8 Aug 2012 09:43:26 -0300
> To: <us...@hadoop.apache.org>
> ReplyTo: user@hadoop.apache.org
> Subject: text file to sequence file
>
> My text file works with one line per record with numerical data. How do I
> convert this text file to sequence file?
>
> --
> Flávio Dias
> FlavioDiasPS@gmail.com
>
>

Re: text file to sequence file

Posted by Harit Himanshu <ha...@gmail.com>.
quick question, what is sequence file?

On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:

> If you have large number of files and using MapReduce to do the conversion to Sequence Files, set the output format of the MR job as SequenceFileOutputFormat.
> Regards
> Bejoy KS
> 
> Sent from handheld, please excuse typos.
> From: Flavio Dias <fl...@gmail.com>
> Date: Wed, 8 Aug 2012 09:43:26 -0300
> To: <us...@hadoop.apache.org>
> ReplyTo: user@hadoop.apache.org
> Subject: text file to sequence file
> 
> My text file works with one line per record with numerical data. How do I convert this text file to sequence file?
> 
> -- 
> Flávio Dias
> FlavioDiasPS@gmail.com
> 


Re: text file to sequence file

Posted by Harit Himanshu <ha...@gmail.com>.
quick question, what is sequence file?

On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:

> If you have large number of files and using MapReduce to do the conversion to Sequence Files, set the output format of the MR job as SequenceFileOutputFormat.
> Regards
> Bejoy KS
> 
> Sent from handheld, please excuse typos.
> From: Flavio Dias <fl...@gmail.com>
> Date: Wed, 8 Aug 2012 09:43:26 -0300
> To: <us...@hadoop.apache.org>
> ReplyTo: user@hadoop.apache.org
> Subject: text file to sequence file
> 
> My text file works with one line per record with numerical data. How do I convert this text file to sequence file?
> 
> -- 
> Flávio Dias
> FlavioDiasPS@gmail.com
> 


Re: text file to sequence file

Posted by Harit Himanshu <ha...@gmail.com>.
quick question, what is sequence file?

On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:

> If you have large number of files and using MapReduce to do the conversion to Sequence Files, set the output format of the MR job as SequenceFileOutputFormat.
> Regards
> Bejoy KS
> 
> Sent from handheld, please excuse typos.
> From: Flavio Dias <fl...@gmail.com>
> Date: Wed, 8 Aug 2012 09:43:26 -0300
> To: <us...@hadoop.apache.org>
> ReplyTo: user@hadoop.apache.org
> Subject: text file to sequence file
> 
> My text file works with one line per record with numerical data. How do I convert this text file to sequence file?
> 
> -- 
> Flávio Dias
> FlavioDiasPS@gmail.com
> 


Re: text file to sequence file

Posted by Harit Himanshu <ha...@gmail.com>.
quick question, what is sequence file?

On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:

> If you have large number of files and using MapReduce to do the conversion to Sequence Files, set the output format of the MR job as SequenceFileOutputFormat.
> Regards
> Bejoy KS
> 
> Sent from handheld, please excuse typos.
> From: Flavio Dias <fl...@gmail.com>
> Date: Wed, 8 Aug 2012 09:43:26 -0300
> To: <us...@hadoop.apache.org>
> ReplyTo: user@hadoop.apache.org
> Subject: text file to sequence file
> 
> My text file works with one line per record with numerical data. How do I convert this text file to sequence file?
> 
> -- 
> Flávio Dias
> FlavioDiasPS@gmail.com
> 


Re: text file to sequence file

Posted by Bejoy KS <be...@gmail.com>.
If you have large number of files and using MapReduce to do the conversion to Sequence Files, set the output format of the MR job as SequenceFileOutputFormat.
 
Regards
Bejoy KS

Sent from handheld, please excuse typos.

-----Original Message-----
From: Flavio Dias <fl...@gmail.com>
Date: Wed, 8 Aug 2012 09:43:26 
To: <us...@hadoop.apache.org>
Reply-To: user@hadoop.apache.org
Subject: text file to sequence file

My text file works with one line per record with numerical data. How do I
convert this text file to sequence file?

-- 
Flávio Dias
FlavioDiasPS@gmail.com


Re: text file to sequence file

Posted by Bejoy KS <be...@gmail.com>.
If you have large number of files and using MapReduce to do the conversion to Sequence Files, set the output format of the MR job as SequenceFileOutputFormat.
 
Regards
Bejoy KS

Sent from handheld, please excuse typos.

-----Original Message-----
From: Flavio Dias <fl...@gmail.com>
Date: Wed, 8 Aug 2012 09:43:26 
To: <us...@hadoop.apache.org>
Reply-To: user@hadoop.apache.org
Subject: text file to sequence file

My text file works with one line per record with numerical data. How do I
convert this text file to sequence file?

-- 
Flávio Dias
FlavioDiasPS@gmail.com


Re: text file to sequence file

Posted by Bejoy KS <be...@gmail.com>.
If you have large number of files and using MapReduce to do the conversion to Sequence Files, set the output format of the MR job as SequenceFileOutputFormat.
 
Regards
Bejoy KS

Sent from handheld, please excuse typos.

-----Original Message-----
From: Flavio Dias <fl...@gmail.com>
Date: Wed, 8 Aug 2012 09:43:26 
To: <us...@hadoop.apache.org>
Reply-To: user@hadoop.apache.org
Subject: text file to sequence file

My text file works with one line per record with numerical data. How do I
convert this text file to sequence file?

-- 
Flávio Dias
FlavioDiasPS@gmail.com


Re: text file to sequence file

Posted by Bejoy KS <be...@gmail.com>.
If you have large number of files and using MapReduce to do the conversion to Sequence Files, set the output format of the MR job as SequenceFileOutputFormat.
 
Regards
Bejoy KS

Sent from handheld, please excuse typos.

-----Original Message-----
From: Flavio Dias <fl...@gmail.com>
Date: Wed, 8 Aug 2012 09:43:26 
To: <us...@hadoop.apache.org>
Reply-To: user@hadoop.apache.org
Subject: text file to sequence file

My text file works with one line per record with numerical data. How do I
convert this text file to sequence file?

-- 
Flávio Dias
FlavioDiasPS@gmail.com