You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-commits@hadoop.apache.org by Apache Wiki <wi...@apache.org> on 2009/07/09 05:56:48 UTC

[Hadoop Wiki] Update of "HadoopMapReduce" by DanielTunkelang

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The following page has been changed by DanielTunkelang:
http://wiki.apache.org/hadoop/HadoopMapReduce

The comment on the change is:
updated link to deprecated FileSplit class

------------------------------------------------------------------------------
  == Map ==
  
  As the Map operation is parallelized the input file set is first
- split to several pieces called [http://hadoop.apache.org/core/docs/current/api/org/apache/hadoop/mapred/FileSplit.html FileSplits]. If an individual file
+ split to several pieces called [http://hadoop.apache.org/core/docs/current/api/org/apache/hadoop/mapreduce/lib/input/FileSplit.html FileSplits]. If an individual file
  is so large that it will affect seek time it will be split to
  several Splits. The splitting does not know anything about the
  input file's internal logical structure, for example