You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Jeff Rodenburg <je...@gmail.com> on 2005/12/03 19:26:05 UTC

Distributed sort

In one of the Google Labs whitepapers (
http://labs.google.com/papers/mapreduce-osdi04.pdf), a programming construct
known as MapReduce is used in a variety of jobs/tasks within Google's
operation.  As an example of the application of MapReduce, the whitepaper
refers to Distributed Sorting.

Essentially this looks like a data storage strategy to facilitate a
distributed sort feature.  How does this approach compare to Lucene's
sorting mechanism?

Re: Distributed sort

Posted by Jeff Rodenburg <je...@gmail.com>.
thanks Erik

On 12/3/05, Erik Hatcher <er...@ehatchersolutions.com> wrote:
>
>
> On Dec 3, 2005, at 1:26 PM, Jeff Rodenburg wrote:
>
> > In one of the Google Labs whitepapers (
> > http://labs.google.com/papers/mapreduce-osdi04.pdf), a programming
> > construct
> > known as MapReduce is used in a variety of jobs/tasks within Google's
> > operation.  As an example of the application of MapReduce, the
> > whitepaper
> > refers to Distributed Sorting.
> >
> > Essentially this looks like a data storage strategy to facilitate a
> > distributed sort feature.  How does this approach compare to Lucene's
> > sorting mechanism?
>
> Look to Nutch to see how MapReduce is being implemented around Lucene
> (but useful in other ways than Lucene also).
>
>         Erik
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

Re: Distributed sort

Posted by Erik Hatcher <er...@ehatchersolutions.com>.
On Dec 3, 2005, at 1:26 PM, Jeff Rodenburg wrote:

> In one of the Google Labs whitepapers (
> http://labs.google.com/papers/mapreduce-osdi04.pdf), a programming  
> construct
> known as MapReduce is used in a variety of jobs/tasks within Google's
> operation.  As an example of the application of MapReduce, the  
> whitepaper
> refers to Distributed Sorting.
>
> Essentially this looks like a data storage strategy to facilitate a
> distributed sort feature.  How does this approach compare to Lucene's
> sorting mechanism?

Look to Nutch to see how MapReduce is being implemented around Lucene  
(but useful in other ways than Lucene also).

	Erik


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org