You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-commits@hadoop.apache.org by Apache Wiki <wi...@apache.org> on 2007/08/10 11:26:58 UTC
[Lucene-hadoop Wiki] Trivial Update of "HadoopMapReduce" by Arun C Murthy
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Lucene-hadoop Wiki" for change notification.
The following page has been changed by Arun C Murthy:
http://wiki.apache.org/lucene-hadoop/HadoopMapReduce
------------------------------------------------------------------------------
passed to the configured [http://lucene.apache.org/hadoop/docs/api/org/apache/hadoop/mapred/Mapper.html Mapper]. The user supplied Mapper does
whatever it wants with the input pair and calls [http://lucene.apache.org/hadoop/docs/api/org/apache/hadoop/mapred/OutputCollector.html#collect(org.apache.hadoop.io.WritableComparable,%20org.apache.hadoop.io.Writable) OutputCollector.collect] with key-value pairs of its own choosing. The output it
generates must use one key class and one value class. This is because
- the Map output will be written into a [http://lucene.apache.org/hadoop/docs/api/org/apache/hadoop/io/SequenceFile.html SequenceFile]
+ the Map output will be written into a [http://wiki.apache.org/lucene-hadoop/SequenceFile SequenceFile]
which has per-file type information and all the records must
have the same type (use subclassing if you want to output
different data structures). The Map input and output key-value