You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hadoop.apache.org by John Lilley <jo...@redpoint.net> on 2013/06/17 16:22:04 UTC

Hadoop database ecosystem overview

I'd like to find a web site or some slides that clearly delineate the "databases" of the Hadoop ecosystem and what they are each good at.  If we look at HBase, Hive, and Cassandra (and others?) is it easy to differentiate them based on:

*         Transactional vs batch

*         Full database operations (insert, delete, update) vs query-only vs append-only

*         Managed storage vs open/flat/file storage
Also it would be good to see how these stack up against GreenPlum and MongoDb
Thanks
john


Re: Hadoop database ecosystem overview

Posted by Ted Yu <yu...@gmail.com>.
John:

The following was one recent case study:

http://nosql.mypopescu.com/post/34305991777/ycsb-benchmark-results-for-cassandra-hbase-mongodb
Cheers

On Mon, Jun 17, 2013 at 7:22 AM, John Lilley <jo...@redpoint.net>wrote:

>  I’d like to find a web site or some slides that clearly delineate the
> “databases” of the Hadoop ecosystem and what they are each good at.  If we
> look at HBase, Hive, and Cassandra (and others?) is it easy to
> differentiate them based on:****
>
> **·         **Transactional vs batch****
>
> **·         **Full database operations (insert, delete, update) vs
> query-only vs append-only****
>
> **·         **Managed storage vs open/flat/file storage****
>
> Also it would be good to see how these stack up against GreenPlum and
> MongoDb****
>
> Thanks****
>
> john ****
>
> ** **
>

Re: Hadoop database ecosystem overview

Posted by Ted Yu <yu...@gmail.com>.
John:

The following was one recent case study:

http://nosql.mypopescu.com/post/34305991777/ycsb-benchmark-results-for-cassandra-hbase-mongodb
Cheers

On Mon, Jun 17, 2013 at 7:22 AM, John Lilley <jo...@redpoint.net>wrote:

>  I’d like to find a web site or some slides that clearly delineate the
> “databases” of the Hadoop ecosystem and what they are each good at.  If we
> look at HBase, Hive, and Cassandra (and others?) is it easy to
> differentiate them based on:****
>
> **·         **Transactional vs batch****
>
> **·         **Full database operations (insert, delete, update) vs
> query-only vs append-only****
>
> **·         **Managed storage vs open/flat/file storage****
>
> Also it would be good to see how these stack up against GreenPlum and
> MongoDb****
>
> Thanks****
>
> john ****
>
> ** **
>

Re: Hadoop database ecosystem overview

Posted by Ted Yu <yu...@gmail.com>.
John:

The following was one recent case study:

http://nosql.mypopescu.com/post/34305991777/ycsb-benchmark-results-for-cassandra-hbase-mongodb
Cheers

On Mon, Jun 17, 2013 at 7:22 AM, John Lilley <jo...@redpoint.net>wrote:

>  I’d like to find a web site or some slides that clearly delineate the
> “databases” of the Hadoop ecosystem and what they are each good at.  If we
> look at HBase, Hive, and Cassandra (and others?) is it easy to
> differentiate them based on:****
>
> **·         **Transactional vs batch****
>
> **·         **Full database operations (insert, delete, update) vs
> query-only vs append-only****
>
> **·         **Managed storage vs open/flat/file storage****
>
> Also it would be good to see how these stack up against GreenPlum and
> MongoDb****
>
> Thanks****
>
> john ****
>
> ** **
>

Re: Hadoop database ecosystem overview

Posted by Ted Yu <yu...@gmail.com>.
John:

The following was one recent case study:

http://nosql.mypopescu.com/post/34305991777/ycsb-benchmark-results-for-cassandra-hbase-mongodb
Cheers

On Mon, Jun 17, 2013 at 7:22 AM, John Lilley <jo...@redpoint.net>wrote:

>  I’d like to find a web site or some slides that clearly delineate the
> “databases” of the Hadoop ecosystem and what they are each good at.  If we
> look at HBase, Hive, and Cassandra (and others?) is it easy to
> differentiate them based on:****
>
> **·         **Transactional vs batch****
>
> **·         **Full database operations (insert, delete, update) vs
> query-only vs append-only****
>
> **·         **Managed storage vs open/flat/file storage****
>
> Also it would be good to see how these stack up against GreenPlum and
> MongoDb****
>
> Thanks****
>
> john ****
>
> ** **
>