You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-commits@hadoop.apache.org by Apache Wiki <wi...@apache.org> on 2007/01/31 21:17:56 UTC

[Lucene-hadoop Wiki] Update of "Hbase" by JimKellerman

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Lucene-hadoop Wiki" for change notification.

The following page has been changed by JimKellerman:
http://wiki.apache.org/lucene-hadoop/Hbase

New page:
##language:en
#pragma section-numbers off

= Hbase - Bigtable-like structured storage for Hadoop HDFS =

== Background ==

Google's [http://labs.google.com/papers/bigtable.html Bigtable],
a distributed storage system for structured data, is a very effective 
mechanism for storing very large amounts of data in a distributed
environment.
 
Just as Bigtable leverages the distributed data storage provided
by the [http://labs.google.com/papers/gfs.html Google File System],
Hbase will provide Bigtable-like capabilities on top of Hadoop.

Data is organized into tables, rows and columns, but a query language
like SQL is not supported. Instead, an Iterator-like interface is available
for scanning through a row range (and of course there is an ability to
retrieve a column value for a specific key).

Any particular column may have multiple values for the same row key.
A secondary key can be provided to select a particular value or an
Iterator can be set up to scan through the key-value pairs for that column 
given a specific row key.

== Rationale ==

Both Google's GFS and Hadoop's HDFS provide a mechanism to
reliably store large amounts of data. However, there is not really a 
mechanism for organizing the data and accessing only the parts that
are of interest to a particular application.

Bigtable (and Hbase) provide a means for organizing and efficiently
accessing these large data sets.

= Project Links =

Self:HbaseArchitecture - architecture - a work in progress.

== Initial Contributors ==

  * JimKellerman [[MailTo(jim AT SPAMFREE powerset DOT com)]]
  * Doug Judd [[MailTo(doug AT SPAMFREE zvents DOT com)]]
  * Ivan Small [[MailTo(ivan AT SPAMFREE blueseaturtle DOT com)]]