You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@apex.apache.org by Sandeep Deshmukh <sa...@datatorrent.com> on 2015/09/29 12:01:59 UTC

Re: How to use FileSplitter to read huge file

Moving to dev@apex.

Regards,
Sandeep

On Tue, Sep 29, 2015 at 3:29 PM, Sandeep Deshmukh <sa...@datatorrent.com>
wrote:

> If you need to consider record boundary, you will need to handle it
> similar to TextInputFormat of Hadoop
>
> Regards,
> Sandeep
>
> On Tue, Sep 29, 2015 at 1:45 PM, chiranjeevi vasupilli <
> chiru.vcj@gmail.com> wrote:
>
>> I have one on block size , How to decide the block size?
>>
>> as per my understanding the
>>
>> noofBlocks=filesize / blocksize
>>
>> By this some records may be split into two blocks, when converting into
>>  record we dont have the complete data in one block.
>>
>> how to handle this scenario?
>>
>> thanks in adavance.
>>
>> Thanks -Chiru
>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "apex-dev" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to apex-dev+unsubscribe@googlegroups.com.
>> To post to this group, send email to apex-dev@googlegroups.com.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/apex-dev/43d87684-c53e-4b8d-85f6-6efe9b7d69b5%40googlegroups.com
>> <https://groups.google.com/d/msgid/apex-dev/43d87684-c53e-4b8d-85f6-6efe9b7d69b5%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>