You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@couchdb.apache.org by Christopher Bare <cb...@systemsbiology.org> on 2010/09/20 05:31:12 UTC

distributed map-reduce queries

Hi Couchers,

I'm investigating using CouchDB for a data mining application and
could use some advice.

What I have in mind is sharding a collection of documents between
several instances of CouchDB each running on their own nodes. Then, I
want to run distributed map-reduce queries over the whole collection
of documents. Do I understand correctly that Lounge is currently the
way to do this?

How would doing something like this with CouchDB and Lounge compare
with using Hadoop and HBase?

Thanks for any insight!

-Chris

Re: distributed map-reduce queries

Posted by David Coallier <da...@php.net>.
On 20 September 2010 04:31, Christopher Bare <cb...@systemsbiology.org> wrote:
> Hi Couchers,
>
> I'm investigating using CouchDB for a data mining application and
> could use some advice.
>
> What I have in mind is sharding a collection of documents between
> several instances of CouchDB each running on their own nodes. Then, I
> want to run distributed map-reduce queries over the whole collection
> of documents. Do I understand correctly that Lounge is currently the
> way to do this?
>
> How would doing something like this with CouchDB and Lounge compare
> with using Hadoop and HBase?
>

You should take a look at BigCouch from Cloudant it's fairly
impressive and very much along the lines of what you want to do I
believe: https://cloudant.com/#!/solutions/bigcouch or
http://github.com/cloudant/bigcouch

-- 
David Coallier