You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hbase.apache.org by Andrew Purtell <ap...@apache.org> on 2009/08/07 18:33:14 UTC

Re: HBase Analytics: A New DB for 80% of Facebook, YouTube-scale Sites

I agree. Well done!

You may also want to look at how Vertica implements parallel analytics on
top of a column based store commercially:
  http://www.vertica.com/_pdf/VerticaArchitectureWhitePaper.pdf

For example, this is interesting:

  "Logical tables are decomposed and physically stored as overlapping
   groups of columns, called "projections," and each projection is sorted
   on a different attribute (or set of attributes), which optimizes them
   for answering queries with predicates on its sort attributes."

Call that insert-time view materialization? 

Clint Morgan's THBase contrib is maybe the first step down the road to
something like that done with an eager strategy. A lazy approach could do
something in the background whenever a WAL is rolled.

   - Andy




________________________________
From: stack <st...@duboce.net>
To: hbase-user@hadoop.apache.org
Sent: Friday, August 7, 2009 8:55:01 AM
Subject: Re: HBase Analytics: A New DB for 80% of Facebook, YouTube-scale  Sites

Its excellent.
St.Ack

On Fri, Aug 7, 2009 at 8:43 AM, Bradford Stephens <
bradfordstephens@gmail.com> wrote:

> Hey all,
>
> I'd really appreciate your feedback on an article I just wrote on a
> new kind of DB, build on HBase, to serve massive-scale websites and do
> low-latency analytics.
>
> http://www.roadtofailure.com/2009/08/07/a-new-db-for-80-of-facebook-youtube-scale-sites/
>
> Feel free to leave feedback, even if you vehemently disagree! Greatly
> appreciated.
>
> --
> http://www.roadtofailure.com -- The Fringes of Scalability, Social
> Media, and Computer Science
>