You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-user@hadoop.apache.org by psdc1978 <ps...@gmail.com> on 2010/06/09 18:36:44 UTC

Split files, index files and input files

Hi,

I'm facing difficulty in understanding all the concepts in Hadoop MR.

1 -
Input files in MR contains index files. What's the purpose of the index
files in hadoop?

2 -
MR uses split files. A split file is an input file?


Regards,

-- 
Pedro

Re: Split files, index files and input files

Posted by Ted Yu <yu...@gmail.com>.
For 1, see
http://hadoop.apache.org/common/docs/current/api/org/apache/hadoop/io/MapFile.html
For 2, see
http://hadoop.apache.org/common/docs/current/api/org/apache/hadoop/mapred/InputSplit.html

On Wed, Jun 9, 2010 at 9:36 AM, psdc1978 <ps...@gmail.com> wrote:

> Hi,
>
> I'm facing difficulty in understanding all the concepts in Hadoop MR.
>
> 1 -
> Input files in MR contains index files. What's the purpose of the index
> files in hadoop?
>
> 2 -
> MR uses split files. A split file is an input file?
>
>
> Regards,
>
> --
> Pedro
>