You are viewing a plain text version of this content. The canonical link for it is here.
Posted to hdfs-user@hadoop.apache.org by Flavio Dias <fl...@gmail.com> on 2012/08/08 14:43:26 UTC
text file to sequence file
My text file works with one line per record with numerical data. How do I
convert this text file to sequence file?
--
Flávio Dias
FlavioDiasPS@gmail.com
Re: text file to sequence file
Posted by Harit Himanshu <ha...@gmail.com>.
cool, thanks Tariq
On Aug 8, 2012, at 8:52 AM, Mohammad Tariq wrote:
> Hello Harit,
>
> SequenceFile is a flat file consisting of binary key/value pairs.
> Since, our data is already is in key/value format it is highly
> efficient to run MapReduce jobs on these files. You can get complete
> info here - http://wiki.apache.org/hadoop/SequenceFile/
>
> Regards,
> Mohammad Tariq
>
>
> On Wed, Aug 8, 2012 at 9:20 PM, Harit Himanshu
> <ha...@gmail.com> wrote:
>> quick question, what is sequence file?
>>
>> On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:
>>
>> If you have large number of files and using MapReduce to do the conversion
>> to Sequence Files, set the output format of the MR job as
>> SequenceFileOutputFormat.
>> Regards
>> Bejoy KS
>>
>> Sent from handheld, please excuse typos.
>> ________________________________
>> From: Flavio Dias <fl...@gmail.com>
>> Date: Wed, 8 Aug 2012 09:43:26 -0300
>> To: <us...@hadoop.apache.org>
>> ReplyTo: user@hadoop.apache.org
>> Subject: text file to sequence file
>>
>> My text file works with one line per record with numerical data. How do I
>> convert this text file to sequence file?
>>
>> --
>> Flávio Dias
>> FlavioDiasPS@gmail.com
>>
>>
Re: text file to sequence file
Posted by Harit Himanshu <ha...@gmail.com>.
cool, thanks Tariq
On Aug 8, 2012, at 8:52 AM, Mohammad Tariq wrote:
> Hello Harit,
>
> SequenceFile is a flat file consisting of binary key/value pairs.
> Since, our data is already is in key/value format it is highly
> efficient to run MapReduce jobs on these files. You can get complete
> info here - http://wiki.apache.org/hadoop/SequenceFile/
>
> Regards,
> Mohammad Tariq
>
>
> On Wed, Aug 8, 2012 at 9:20 PM, Harit Himanshu
> <ha...@gmail.com> wrote:
>> quick question, what is sequence file?
>>
>> On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:
>>
>> If you have large number of files and using MapReduce to do the conversion
>> to Sequence Files, set the output format of the MR job as
>> SequenceFileOutputFormat.
>> Regards
>> Bejoy KS
>>
>> Sent from handheld, please excuse typos.
>> ________________________________
>> From: Flavio Dias <fl...@gmail.com>
>> Date: Wed, 8 Aug 2012 09:43:26 -0300
>> To: <us...@hadoop.apache.org>
>> ReplyTo: user@hadoop.apache.org
>> Subject: text file to sequence file
>>
>> My text file works with one line per record with numerical data. How do I
>> convert this text file to sequence file?
>>
>> --
>> Flávio Dias
>> FlavioDiasPS@gmail.com
>>
>>
Re: text file to sequence file
Posted by Harit Himanshu <ha...@gmail.com>.
cool, thanks Tariq
On Aug 8, 2012, at 8:52 AM, Mohammad Tariq wrote:
> Hello Harit,
>
> SequenceFile is a flat file consisting of binary key/value pairs.
> Since, our data is already is in key/value format it is highly
> efficient to run MapReduce jobs on these files. You can get complete
> info here - http://wiki.apache.org/hadoop/SequenceFile/
>
> Regards,
> Mohammad Tariq
>
>
> On Wed, Aug 8, 2012 at 9:20 PM, Harit Himanshu
> <ha...@gmail.com> wrote:
>> quick question, what is sequence file?
>>
>> On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:
>>
>> If you have large number of files and using MapReduce to do the conversion
>> to Sequence Files, set the output format of the MR job as
>> SequenceFileOutputFormat.
>> Regards
>> Bejoy KS
>>
>> Sent from handheld, please excuse typos.
>> ________________________________
>> From: Flavio Dias <fl...@gmail.com>
>> Date: Wed, 8 Aug 2012 09:43:26 -0300
>> To: <us...@hadoop.apache.org>
>> ReplyTo: user@hadoop.apache.org
>> Subject: text file to sequence file
>>
>> My text file works with one line per record with numerical data. How do I
>> convert this text file to sequence file?
>>
>> --
>> Flávio Dias
>> FlavioDiasPS@gmail.com
>>
>>
Re: text file to sequence file
Posted by Harit Himanshu <ha...@gmail.com>.
cool, thanks Tariq
On Aug 8, 2012, at 8:52 AM, Mohammad Tariq wrote:
> Hello Harit,
>
> SequenceFile is a flat file consisting of binary key/value pairs.
> Since, our data is already is in key/value format it is highly
> efficient to run MapReduce jobs on these files. You can get complete
> info here - http://wiki.apache.org/hadoop/SequenceFile/
>
> Regards,
> Mohammad Tariq
>
>
> On Wed, Aug 8, 2012 at 9:20 PM, Harit Himanshu
> <ha...@gmail.com> wrote:
>> quick question, what is sequence file?
>>
>> On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:
>>
>> If you have large number of files and using MapReduce to do the conversion
>> to Sequence Files, set the output format of the MR job as
>> SequenceFileOutputFormat.
>> Regards
>> Bejoy KS
>>
>> Sent from handheld, please excuse typos.
>> ________________________________
>> From: Flavio Dias <fl...@gmail.com>
>> Date: Wed, 8 Aug 2012 09:43:26 -0300
>> To: <us...@hadoop.apache.org>
>> ReplyTo: user@hadoop.apache.org
>> Subject: text file to sequence file
>>
>> My text file works with one line per record with numerical data. How do I
>> convert this text file to sequence file?
>>
>> --
>> Flávio Dias
>> FlavioDiasPS@gmail.com
>>
>>
Re: text file to sequence file
Posted by Mohammad Tariq <do...@gmail.com>.
Hello Harit,
SequenceFile is a flat file consisting of binary key/value pairs.
Since, our data is already is in key/value format it is highly
efficient to run MapReduce jobs on these files. You can get complete
info here - http://wiki.apache.org/hadoop/SequenceFile/
Regards,
Mohammad Tariq
On Wed, Aug 8, 2012 at 9:20 PM, Harit Himanshu
<ha...@gmail.com> wrote:
> quick question, what is sequence file?
>
> On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:
>
> If you have large number of files and using MapReduce to do the conversion
> to Sequence Files, set the output format of the MR job as
> SequenceFileOutputFormat.
> Regards
> Bejoy KS
>
> Sent from handheld, please excuse typos.
> ________________________________
> From: Flavio Dias <fl...@gmail.com>
> Date: Wed, 8 Aug 2012 09:43:26 -0300
> To: <us...@hadoop.apache.org>
> ReplyTo: user@hadoop.apache.org
> Subject: text file to sequence file
>
> My text file works with one line per record with numerical data. How do I
> convert this text file to sequence file?
>
> --
> Flávio Dias
> FlavioDiasPS@gmail.com
>
>
Re: text file to sequence file
Posted by Mohammad Tariq <do...@gmail.com>.
Hello Harit,
SequenceFile is a flat file consisting of binary key/value pairs.
Since, our data is already is in key/value format it is highly
efficient to run MapReduce jobs on these files. You can get complete
info here - http://wiki.apache.org/hadoop/SequenceFile/
Regards,
Mohammad Tariq
On Wed, Aug 8, 2012 at 9:20 PM, Harit Himanshu
<ha...@gmail.com> wrote:
> quick question, what is sequence file?
>
> On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:
>
> If you have large number of files and using MapReduce to do the conversion
> to Sequence Files, set the output format of the MR job as
> SequenceFileOutputFormat.
> Regards
> Bejoy KS
>
> Sent from handheld, please excuse typos.
> ________________________________
> From: Flavio Dias <fl...@gmail.com>
> Date: Wed, 8 Aug 2012 09:43:26 -0300
> To: <us...@hadoop.apache.org>
> ReplyTo: user@hadoop.apache.org
> Subject: text file to sequence file
>
> My text file works with one line per record with numerical data. How do I
> convert this text file to sequence file?
>
> --
> Flávio Dias
> FlavioDiasPS@gmail.com
>
>
Re: text file to sequence file
Posted by Mohammad Tariq <do...@gmail.com>.
Hello Harit,
SequenceFile is a flat file consisting of binary key/value pairs.
Since, our data is already is in key/value format it is highly
efficient to run MapReduce jobs on these files. You can get complete
info here - http://wiki.apache.org/hadoop/SequenceFile/
Regards,
Mohammad Tariq
On Wed, Aug 8, 2012 at 9:20 PM, Harit Himanshu
<ha...@gmail.com> wrote:
> quick question, what is sequence file?
>
> On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:
>
> If you have large number of files and using MapReduce to do the conversion
> to Sequence Files, set the output format of the MR job as
> SequenceFileOutputFormat.
> Regards
> Bejoy KS
>
> Sent from handheld, please excuse typos.
> ________________________________
> From: Flavio Dias <fl...@gmail.com>
> Date: Wed, 8 Aug 2012 09:43:26 -0300
> To: <us...@hadoop.apache.org>
> ReplyTo: user@hadoop.apache.org
> Subject: text file to sequence file
>
> My text file works with one line per record with numerical data. How do I
> convert this text file to sequence file?
>
> --
> Flávio Dias
> FlavioDiasPS@gmail.com
>
>
Re: text file to sequence file
Posted by Mohammad Tariq <do...@gmail.com>.
Hello Harit,
SequenceFile is a flat file consisting of binary key/value pairs.
Since, our data is already is in key/value format it is highly
efficient to run MapReduce jobs on these files. You can get complete
info here - http://wiki.apache.org/hadoop/SequenceFile/
Regards,
Mohammad Tariq
On Wed, Aug 8, 2012 at 9:20 PM, Harit Himanshu
<ha...@gmail.com> wrote:
> quick question, what is sequence file?
>
> On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:
>
> If you have large number of files and using MapReduce to do the conversion
> to Sequence Files, set the output format of the MR job as
> SequenceFileOutputFormat.
> Regards
> Bejoy KS
>
> Sent from handheld, please excuse typos.
> ________________________________
> From: Flavio Dias <fl...@gmail.com>
> Date: Wed, 8 Aug 2012 09:43:26 -0300
> To: <us...@hadoop.apache.org>
> ReplyTo: user@hadoop.apache.org
> Subject: text file to sequence file
>
> My text file works with one line per record with numerical data. How do I
> convert this text file to sequence file?
>
> --
> Flávio Dias
> FlavioDiasPS@gmail.com
>
>
Re: text file to sequence file
Posted by Harit Himanshu <ha...@gmail.com>.
quick question, what is sequence file?
On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:
> If you have large number of files and using MapReduce to do the conversion to Sequence Files, set the output format of the MR job as SequenceFileOutputFormat.
> Regards
> Bejoy KS
>
> Sent from handheld, please excuse typos.
> From: Flavio Dias <fl...@gmail.com>
> Date: Wed, 8 Aug 2012 09:43:26 -0300
> To: <us...@hadoop.apache.org>
> ReplyTo: user@hadoop.apache.org
> Subject: text file to sequence file
>
> My text file works with one line per record with numerical data. How do I convert this text file to sequence file?
>
> --
> Flávio Dias
> FlavioDiasPS@gmail.com
>
Re: text file to sequence file
Posted by Harit Himanshu <ha...@gmail.com>.
quick question, what is sequence file?
On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:
> If you have large number of files and using MapReduce to do the conversion to Sequence Files, set the output format of the MR job as SequenceFileOutputFormat.
> Regards
> Bejoy KS
>
> Sent from handheld, please excuse typos.
> From: Flavio Dias <fl...@gmail.com>
> Date: Wed, 8 Aug 2012 09:43:26 -0300
> To: <us...@hadoop.apache.org>
> ReplyTo: user@hadoop.apache.org
> Subject: text file to sequence file
>
> My text file works with one line per record with numerical data. How do I convert this text file to sequence file?
>
> --
> Flávio Dias
> FlavioDiasPS@gmail.com
>
Re: text file to sequence file
Posted by Harit Himanshu <ha...@gmail.com>.
quick question, what is sequence file?
On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:
> If you have large number of files and using MapReduce to do the conversion to Sequence Files, set the output format of the MR job as SequenceFileOutputFormat.
> Regards
> Bejoy KS
>
> Sent from handheld, please excuse typos.
> From: Flavio Dias <fl...@gmail.com>
> Date: Wed, 8 Aug 2012 09:43:26 -0300
> To: <us...@hadoop.apache.org>
> ReplyTo: user@hadoop.apache.org
> Subject: text file to sequence file
>
> My text file works with one line per record with numerical data. How do I convert this text file to sequence file?
>
> --
> Flávio Dias
> FlavioDiasPS@gmail.com
>
Re: text file to sequence file
Posted by Harit Himanshu <ha...@gmail.com>.
quick question, what is sequence file?
On Aug 8, 2012, at 8:13 AM, Bejoy KS wrote:
> If you have large number of files and using MapReduce to do the conversion to Sequence Files, set the output format of the MR job as SequenceFileOutputFormat.
> Regards
> Bejoy KS
>
> Sent from handheld, please excuse typos.
> From: Flavio Dias <fl...@gmail.com>
> Date: Wed, 8 Aug 2012 09:43:26 -0300
> To: <us...@hadoop.apache.org>
> ReplyTo: user@hadoop.apache.org
> Subject: text file to sequence file
>
> My text file works with one line per record with numerical data. How do I convert this text file to sequence file?
>
> --
> Flávio Dias
> FlavioDiasPS@gmail.com
>
Re: text file to sequence file
Posted by Bejoy KS <be...@gmail.com>.
If you have large number of files and using MapReduce to do the conversion to Sequence Files, set the output format of the MR job as SequenceFileOutputFormat.
Regards
Bejoy KS
Sent from handheld, please excuse typos.
-----Original Message-----
From: Flavio Dias <fl...@gmail.com>
Date: Wed, 8 Aug 2012 09:43:26
To: <us...@hadoop.apache.org>
Reply-To: user@hadoop.apache.org
Subject: text file to sequence file
My text file works with one line per record with numerical data. How do I
convert this text file to sequence file?
--
Flávio Dias
FlavioDiasPS@gmail.com
Re: text file to sequence file
Posted by Bejoy KS <be...@gmail.com>.
If you have large number of files and using MapReduce to do the conversion to Sequence Files, set the output format of the MR job as SequenceFileOutputFormat.
Regards
Bejoy KS
Sent from handheld, please excuse typos.
-----Original Message-----
From: Flavio Dias <fl...@gmail.com>
Date: Wed, 8 Aug 2012 09:43:26
To: <us...@hadoop.apache.org>
Reply-To: user@hadoop.apache.org
Subject: text file to sequence file
My text file works with one line per record with numerical data. How do I
convert this text file to sequence file?
--
Flávio Dias
FlavioDiasPS@gmail.com
Re: text file to sequence file
Posted by Bejoy KS <be...@gmail.com>.
If you have large number of files and using MapReduce to do the conversion to Sequence Files, set the output format of the MR job as SequenceFileOutputFormat.
Regards
Bejoy KS
Sent from handheld, please excuse typos.
-----Original Message-----
From: Flavio Dias <fl...@gmail.com>
Date: Wed, 8 Aug 2012 09:43:26
To: <us...@hadoop.apache.org>
Reply-To: user@hadoop.apache.org
Subject: text file to sequence file
My text file works with one line per record with numerical data. How do I
convert this text file to sequence file?
--
Flávio Dias
FlavioDiasPS@gmail.com
Re: text file to sequence file
Posted by Bejoy KS <be...@gmail.com>.
If you have large number of files and using MapReduce to do the conversion to Sequence Files, set the output format of the MR job as SequenceFileOutputFormat.
Regards
Bejoy KS
Sent from handheld, please excuse typos.
-----Original Message-----
From: Flavio Dias <fl...@gmail.com>
Date: Wed, 8 Aug 2012 09:43:26
To: <us...@hadoop.apache.org>
Reply-To: user@hadoop.apache.org
Subject: text file to sequence file
My text file works with one line per record with numerical data. How do I
convert this text file to sequence file?
--
Flávio Dias
FlavioDiasPS@gmail.com