You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2010/09/15 15:57:09 UTC
[Nutch Wiki] Trivial Update of "MapReduce" by AndreRicardo
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "MapReduce" page has been changed by AndreRicardo.
http://wiki.apache.org/nutch/MapReduce?action=diff&rev1=6&rev2=7
--------------------------------------------------
* In essence, it allows massive data sets to be processed in a distributed fashion by breaking the processing into many small computations of two types:
1. A Map operation that transforms the input into an intermediate representation.
- 2. A Reduce function that recombines the intermediate representation into the final output.
+ 1. A Reduce function that recombines the intermediate representation into the final output.
- * This processing model is ideal for the operations a search engine indexer like Nutch or Google needs to perform - like computing inlinks for URLs, or building inverted indexes - and it will [[http://wiki.apache.org/nutch-data/attachments/Presentations/attachments/mapred.pdf|"transform Nutch"]] into a scalable, distributed search engine.
+ * This processing model is ideal for the operations a search engine indexer like Nutch or Google needs to perform - like computing inlinks for URLs, or building inverted indexes - and it will [[attachment:Presentations/mapred.pdf|"transform Nutch"]] into a scalable, distributed search engine.