You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-user@hadoop.apache.org by Anit Alexander <an...@gmail.com> on 2012/09/03 13:00:43 UTC
custom format
hello user,
I am trying to create a map reduce program which will have splits
based on a specific length. The content has to be extracted in a way
such that the newline(\n) or tab(\t) etc characters will be considered
as a byte and not as a mapper instance. is this possible through
custom input? if yes, how will i create a custom file split based on a
specific length value. Any suggestions?
Regards,
Anit
Re: custom format
Posted by Anit Alexander <an...@gmail.com>.
Hi Hemanth,
Thank you for your valuable reply.
Regards,
Anit
On Mon, Sep 3, 2012 at 4:57 PM, Hemanth Yamijala <yh...@gmail.com> wrote:
> Hi,
>
> I found this while trying to see if such a FileFormat or Split already exists:
> http://bitsofinfo.wordpress.com/2009/11/01/reading-fixed-length-width-input-record-reader-with-hadoop-mapreduce/
>
> I have certainly not tried it myself, hence can't say if it is
> current, etc. But maybe it'll help you in some way.
>
> Thanks
> Hemanth
>
> On Mon, Sep 3, 2012 at 4:30 PM, Anit Alexander <an...@gmail.com> wrote:
>> hello user,
>>
>> I am trying to create a map reduce program which will have splits
>> based on a specific length. The content has to be extracted in a way
>> such that the newline(\n) or tab(\t) etc characters will be considered
>> as a byte and not as a mapper instance. is this possible through
>> custom input? if yes, how will i create a custom file split based on a
>> specific length value. Any suggestions?
>>
>> Regards,
>> Anit
Re: custom format
Posted by Anit Alexander <an...@gmail.com>.
Hi Hemanth,
Thank you for your valuable reply.
Regards,
Anit
On Mon, Sep 3, 2012 at 4:57 PM, Hemanth Yamijala <yh...@gmail.com> wrote:
> Hi,
>
> I found this while trying to see if such a FileFormat or Split already exists:
> http://bitsofinfo.wordpress.com/2009/11/01/reading-fixed-length-width-input-record-reader-with-hadoop-mapreduce/
>
> I have certainly not tried it myself, hence can't say if it is
> current, etc. But maybe it'll help you in some way.
>
> Thanks
> Hemanth
>
> On Mon, Sep 3, 2012 at 4:30 PM, Anit Alexander <an...@gmail.com> wrote:
>> hello user,
>>
>> I am trying to create a map reduce program which will have splits
>> based on a specific length. The content has to be extracted in a way
>> such that the newline(\n) or tab(\t) etc characters will be considered
>> as a byte and not as a mapper instance. is this possible through
>> custom input? if yes, how will i create a custom file split based on a
>> specific length value. Any suggestions?
>>
>> Regards,
>> Anit
Re: custom format
Posted by Anit Alexander <an...@gmail.com>.
Hi Hemanth,
Thank you for your valuable reply.
Regards,
Anit
On Mon, Sep 3, 2012 at 4:57 PM, Hemanth Yamijala <yh...@gmail.com> wrote:
> Hi,
>
> I found this while trying to see if such a FileFormat or Split already exists:
> http://bitsofinfo.wordpress.com/2009/11/01/reading-fixed-length-width-input-record-reader-with-hadoop-mapreduce/
>
> I have certainly not tried it myself, hence can't say if it is
> current, etc. But maybe it'll help you in some way.
>
> Thanks
> Hemanth
>
> On Mon, Sep 3, 2012 at 4:30 PM, Anit Alexander <an...@gmail.com> wrote:
>> hello user,
>>
>> I am trying to create a map reduce program which will have splits
>> based on a specific length. The content has to be extracted in a way
>> such that the newline(\n) or tab(\t) etc characters will be considered
>> as a byte and not as a mapper instance. is this possible through
>> custom input? if yes, how will i create a custom file split based on a
>> specific length value. Any suggestions?
>>
>> Regards,
>> Anit
Re: custom format
Posted by Anit Alexander <an...@gmail.com>.
Hi Hemanth,
Thank you for your valuable reply.
Regards,
Anit
On Mon, Sep 3, 2012 at 4:57 PM, Hemanth Yamijala <yh...@gmail.com> wrote:
> Hi,
>
> I found this while trying to see if such a FileFormat or Split already exists:
> http://bitsofinfo.wordpress.com/2009/11/01/reading-fixed-length-width-input-record-reader-with-hadoop-mapreduce/
>
> I have certainly not tried it myself, hence can't say if it is
> current, etc. But maybe it'll help you in some way.
>
> Thanks
> Hemanth
>
> On Mon, Sep 3, 2012 at 4:30 PM, Anit Alexander <an...@gmail.com> wrote:
>> hello user,
>>
>> I am trying to create a map reduce program which will have splits
>> based on a specific length. The content has to be extracted in a way
>> such that the newline(\n) or tab(\t) etc characters will be considered
>> as a byte and not as a mapper instance. is this possible through
>> custom input? if yes, how will i create a custom file split based on a
>> specific length value. Any suggestions?
>>
>> Regards,
>> Anit
Re: custom format
Posted by Hemanth Yamijala <yh...@gmail.com>.
Hi,
I found this while trying to see if such a FileFormat or Split already exists:
http://bitsofinfo.wordpress.com/2009/11/01/reading-fixed-length-width-input-record-reader-with-hadoop-mapreduce/
I have certainly not tried it myself, hence can't say if it is
current, etc. But maybe it'll help you in some way.
Thanks
Hemanth
On Mon, Sep 3, 2012 at 4:30 PM, Anit Alexander <an...@gmail.com> wrote:
> hello user,
>
> I am trying to create a map reduce program which will have splits
> based on a specific length. The content has to be extracted in a way
> such that the newline(\n) or tab(\t) etc characters will be considered
> as a byte and not as a mapper instance. is this possible through
> custom input? if yes, how will i create a custom file split based on a
> specific length value. Any suggestions?
>
> Regards,
> Anit
Re: custom format
Posted by Hemanth Yamijala <yh...@gmail.com>.
Hi,
I found this while trying to see if such a FileFormat or Split already exists:
http://bitsofinfo.wordpress.com/2009/11/01/reading-fixed-length-width-input-record-reader-with-hadoop-mapreduce/
I have certainly not tried it myself, hence can't say if it is
current, etc. But maybe it'll help you in some way.
Thanks
Hemanth
On Mon, Sep 3, 2012 at 4:30 PM, Anit Alexander <an...@gmail.com> wrote:
> hello user,
>
> I am trying to create a map reduce program which will have splits
> based on a specific length. The content has to be extracted in a way
> such that the newline(\n) or tab(\t) etc characters will be considered
> as a byte and not as a mapper instance. is this possible through
> custom input? if yes, how will i create a custom file split based on a
> specific length value. Any suggestions?
>
> Regards,
> Anit
Re: custom format
Posted by Hemanth Yamijala <yh...@gmail.com>.
Hi,
I found this while trying to see if such a FileFormat or Split already exists:
http://bitsofinfo.wordpress.com/2009/11/01/reading-fixed-length-width-input-record-reader-with-hadoop-mapreduce/
I have certainly not tried it myself, hence can't say if it is
current, etc. But maybe it'll help you in some way.
Thanks
Hemanth
On Mon, Sep 3, 2012 at 4:30 PM, Anit Alexander <an...@gmail.com> wrote:
> hello user,
>
> I am trying to create a map reduce program which will have splits
> based on a specific length. The content has to be extracted in a way
> such that the newline(\n) or tab(\t) etc characters will be considered
> as a byte and not as a mapper instance. is this possible through
> custom input? if yes, how will i create a custom file split based on a
> specific length value. Any suggestions?
>
> Regards,
> Anit
Re: custom format
Posted by Hemanth Yamijala <yh...@gmail.com>.
Hi,
I found this while trying to see if such a FileFormat or Split already exists:
http://bitsofinfo.wordpress.com/2009/11/01/reading-fixed-length-width-input-record-reader-with-hadoop-mapreduce/
I have certainly not tried it myself, hence can't say if it is
current, etc. But maybe it'll help you in some way.
Thanks
Hemanth
On Mon, Sep 3, 2012 at 4:30 PM, Anit Alexander <an...@gmail.com> wrote:
> hello user,
>
> I am trying to create a map reduce program which will have splits
> based on a specific length. The content has to be extracted in a way
> such that the newline(\n) or tab(\t) etc characters will be considered
> as a byte and not as a mapper instance. is this possible through
> custom input? if yes, how will i create a custom file split based on a
> specific length value. Any suggestions?
>
> Regards,
> Anit