You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2010/09/15 15:57:09 UTC

[Nutch Wiki] Trivial Update of "MapReduce" by AndreRicardo

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The "MapReduce" page has been changed by AndreRicardo.
http://wiki.apache.org/nutch/MapReduce?action=diff&rev1=6&rev2=7

--------------------------------------------------

  
   * In essence, it allows massive data sets to be processed in a distributed fashion by breaking the processing into many small computations of two types:
    1. A Map operation that transforms the input into an intermediate representation.
-   2. A Reduce function that recombines the intermediate representation into the final output.
+   1. A Reduce function that recombines the intermediate representation into the final output.
  
-  * This processing model is ideal for the operations a search engine indexer like Nutch or Google needs to perform - like computing inlinks for URLs, or building inverted indexes - and it will [[http://wiki.apache.org/nutch-data/attachments/Presentations/attachments/mapred.pdf|"transform Nutch"]] into a scalable, distributed search engine.
+  * This processing model is ideal for the operations a search engine indexer like Nutch or Google needs to perform - like computing inlinks for URLs, or building inverted indexes - and it will [[attachment:Presentations/mapred.pdf|"transform Nutch"]] into a scalable, distributed search engine.