You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hbase.apache.org by Fabio Kaminski <fa...@mobly.com.br> on 2009/08/11 19:29:02 UTC

anyone with experience in hbase 0.20 realtime application?

is there anyone with experience in hbase 0.20 realtime application,
preferably in production environment?

in thinking in throw away all my legacy knowledge about what i think about
systems.. cause i think this is the hadoop(and hbase) are the next big thing
in tecnology.. i really buy this concept and im glady that i found it right
in it's inception.

Im preparing to work with hadoop and hbase for realtime environment, and i
could see that the hbase engineers are preparing hbase for realtime
applications, like rdbms standards does, but in a new and promissing
environment. this is undoubtly a paradigm shift!

anyone with realtime application runing in such environment? could you share
some of you experience with it?

Thanks !

Fabio Kaminski

Re: anyone with experience in hbase 0.20 realtime application?

Posted by Jonathan Gray <jl...@streamy.com>.
There are a few of us running HBase in a realtime application that is in 
production.  Streamy.com and StumbleUpon's su.pr application are two 
consumer-facing web applications with only HBase backing it.

We have lots of experience to share, and do so on this list, in blog 
posts, and through presentations/meetups as best we can.  Do you have 
specific questions?  Are your concerns about administration/operations, 
getting developers used to it, general lessons learned...?

Ryan Rawson from SU has a small but useful presentation he just gave at 
the HUG last weekend.  Hopefully we can get that put up somewhere soon.

JG

Fabio Kaminski wrote:
> is there anyone with experience in hbase 0.20 realtime application,
> preferably in production environment?
> 
> in thinking in throw away all my legacy knowledge about what i think about
> systems.. cause i think this is the hadoop(and hbase) are the next big thing
> in tecnology.. i really buy this concept and im glady that i found it right
> in it's inception.
> 
> Im preparing to work with hadoop and hbase for realtime environment, and i
> could see that the hbase engineers are preparing hbase for realtime
> applications, like rdbms standards does, but in a new and promissing
> environment. this is undoubtly a paradigm shift!
> 
> anyone with realtime application runing in such environment? could you share
> some of you experience with it?
> 
> Thanks !
> 
> Fabio Kaminski
> 

Re: anyone with experience in hbase 0.20 realtime application?

Posted by Vaibhav Puranik <vp...@gmail.com>.
We need some records from database every time our application gets a
request.
The data does not change frequently. That is why we are reading it from
HBase on the application startup and caching it in memory so that we don't
have to go to HBase every single time. These are relatively smaller tables
and can be held in memory very easily.

Important point to note here is that we would have done that even if we were
using RDBMS. As there is  no point in going to DB/HBase if you have
relatively stable data that needs to be read from smaller tables every
single time.

We are on 0.20 but not the release candidate version. We built the trunk on
23rd of July ourselves.

If you have relatively larger data that needs to be read frequently, I would
recommend using memcached.

Regards,
Vaibhav

On Wed, Aug 12, 2009 at 7:01 AM, bharath vissapragada <
bharathvissapragada1990@gmail.com> wrote:

> Vaibhav
>
> By caching do you mean storing storing all the rows in a HashMap in the
> memory , so that u can access that map repeatedly instead of Disk IOs ?
>
> Thanks
>
> On Wed, Aug 12, 2009 at 4:53 AM, Vaibhav Puranik <vp...@gmail.com>
> wrote:
>
> > Amandeep,
> >
> > We are caching Hbase results in memory (in a HashMap).
> >
> > Regards,
> > Vaibhav
> >
> > On Tue, Aug 11, 2009 at 12:56 PM, Amandeep Khurana <am...@gmail.com>
> > wrote:
> >
> > > Vaibhav,
> > >
> > > What kind of caching are you doing over hbase and how?
> > >
> > > -Amandeep
> > >
> > >
> > > Amandeep Khurana
> > > Computer Science Graduate Student
> > > University of California, Santa Cruz
> > >
> > >
> > > On Tue, Aug 11, 2009 at 10:48 AM, Vaibhav Puranik <vpuranik@gmail.com
> > > >wrote:
> > >
> > > > We are using HBase 0.20 (Trunk version at 23rd July evening) in
> > > production
> > > > environment at GumGum.
> > > >
> > > > Our experience is very good. Initially I mistakenly forgot to add
> > caching
> > > > (even though we had planned for it) and every request was fetching
> two
> > > rows
> > > > from Hbase and inserting one row in HBase.
> > > > In spite of that our request processing time was less than 300 ms.
> > > >
> > > > We are not getting huge amounts of requests - we approximately get
> > 25,000
> > > > to
> > > > 30,000 requests to our web app backed by HBase every day.
> > > >
> > > > We have a 4 node cluster running on EC2 (Large instances) and so far
> we
> > > > haven't faced any production problem.
> > > > (Hope it works out that way all the time!)
> > > >
> > > > Regards,
> > > > Vaibhav Puranik,
> > > > GumGum
> > > >
> > > >
> > > >
> > > >
> > > > On Tue, Aug 11, 2009 at 10:29 AM, Fabio Kaminski <
> > > > fabiokaminski@mobly.com.br
> > > > > wrote:
> > > >
> > > > > is there anyone with experience in hbase 0.20 realtime application,
> > > > > preferably in production environment?
> > > > >
> > > > > in thinking in throw away all my legacy knowledge about what i
> think
> > > > about
> > > > > systems.. cause i think this is the hadoop(and hbase) are the next
> > big
> > > > > thing
> > > > > in tecnology.. i really buy this concept and im glady that i found
> it
> > > > right
> > > > > in it's inception.
> > > > >
> > > > > Im preparing to work with hadoop and hbase for realtime
> environment,
> > > and
> > > > i
> > > > > could see that the hbase engineers are preparing hbase for realtime
> > > > > applications, like rdbms standards does, but in a new and
> promissing
> > > > > environment. this is undoubtly a paradigm shift!
> > > > >
> > > > > anyone with realtime application runing in such environment? could
> > you
> > > > > share
> > > > > some of you experience with it?
> > > > >
> > > > > Thanks !
> > > > >
> > > > > Fabio Kaminski
> > > > >
> > > >
> > >
> >
>

Re: anyone with experience in hbase 0.20 realtime application?

Posted by bharath vissapragada <bh...@gmail.com>.
Vaibhav

By caching do you mean storing storing all the rows in a HashMap in the
memory , so that u can access that map repeatedly instead of Disk IOs ?

Thanks

On Wed, Aug 12, 2009 at 4:53 AM, Vaibhav Puranik <vp...@gmail.com> wrote:

> Amandeep,
>
> We are caching Hbase results in memory (in a HashMap).
>
> Regards,
> Vaibhav
>
> On Tue, Aug 11, 2009 at 12:56 PM, Amandeep Khurana <am...@gmail.com>
> wrote:
>
> > Vaibhav,
> >
> > What kind of caching are you doing over hbase and how?
> >
> > -Amandeep
> >
> >
> > Amandeep Khurana
> > Computer Science Graduate Student
> > University of California, Santa Cruz
> >
> >
> > On Tue, Aug 11, 2009 at 10:48 AM, Vaibhav Puranik <vpuranik@gmail.com
> > >wrote:
> >
> > > We are using HBase 0.20 (Trunk version at 23rd July evening) in
> > production
> > > environment at GumGum.
> > >
> > > Our experience is very good. Initially I mistakenly forgot to add
> caching
> > > (even though we had planned for it) and every request was fetching two
> > rows
> > > from Hbase and inserting one row in HBase.
> > > In spite of that our request processing time was less than 300 ms.
> > >
> > > We are not getting huge amounts of requests - we approximately get
> 25,000
> > > to
> > > 30,000 requests to our web app backed by HBase every day.
> > >
> > > We have a 4 node cluster running on EC2 (Large instances) and so far we
> > > haven't faced any production problem.
> > > (Hope it works out that way all the time!)
> > >
> > > Regards,
> > > Vaibhav Puranik,
> > > GumGum
> > >
> > >
> > >
> > >
> > > On Tue, Aug 11, 2009 at 10:29 AM, Fabio Kaminski <
> > > fabiokaminski@mobly.com.br
> > > > wrote:
> > >
> > > > is there anyone with experience in hbase 0.20 realtime application,
> > > > preferably in production environment?
> > > >
> > > > in thinking in throw away all my legacy knowledge about what i think
> > > about
> > > > systems.. cause i think this is the hadoop(and hbase) are the next
> big
> > > > thing
> > > > in tecnology.. i really buy this concept and im glady that i found it
> > > right
> > > > in it's inception.
> > > >
> > > > Im preparing to work with hadoop and hbase for realtime environment,
> > and
> > > i
> > > > could see that the hbase engineers are preparing hbase for realtime
> > > > applications, like rdbms standards does, but in a new and promissing
> > > > environment. this is undoubtly a paradigm shift!
> > > >
> > > > anyone with realtime application runing in such environment? could
> you
> > > > share
> > > > some of you experience with it?
> > > >
> > > > Thanks !
> > > >
> > > > Fabio Kaminski
> > > >
> > >
> >
>

Re: anyone with experience in hbase 0.20 realtime application?

Posted by Amandeep Khurana <am...@gmail.com>.
Vaibhav

Can you explain a little more? I'm also building a system where the
front end will feed off hbase but haven't implemented any caching yet.
Maybe you could give me some tips and share your experience. Also,
what version of hbase are you on? .20 has blockcache enabled by
default and that makes reads much faster.

-amandeep

On 8/11/09, Vaibhav Puranik <vp...@gmail.com> wrote:
> Amandeep,
>
> We are caching Hbase results in memory (in a HashMap).
>
> Regards,
> Vaibhav
>
> On Tue, Aug 11, 2009 at 12:56 PM, Amandeep Khurana <am...@gmail.com> wrote:
>
>> Vaibhav,
>>
>> What kind of caching are you doing over hbase and how?
>>
>> -Amandeep
>>
>>
>> Amandeep Khurana
>> Computer Science Graduate Student
>> University of California, Santa Cruz
>>
>>
>> On Tue, Aug 11, 2009 at 10:48 AM, Vaibhav Puranik <vpuranik@gmail.com
>> >wrote:
>>
>> > We are using HBase 0.20 (Trunk version at 23rd July evening) in
>> production
>> > environment at GumGum.
>> >
>> > Our experience is very good. Initially I mistakenly forgot to add
>> > caching
>> > (even though we had planned for it) and every request was fetching two
>> rows
>> > from Hbase and inserting one row in HBase.
>> > In spite of that our request processing time was less than 300 ms.
>> >
>> > We are not getting huge amounts of requests - we approximately get
>> > 25,000
>> > to
>> > 30,000 requests to our web app backed by HBase every day.
>> >
>> > We have a 4 node cluster running on EC2 (Large instances) and so far we
>> > haven't faced any production problem.
>> > (Hope it works out that way all the time!)
>> >
>> > Regards,
>> > Vaibhav Puranik,
>> > GumGum
>> >
>> >
>> >
>> >
>> > On Tue, Aug 11, 2009 at 10:29 AM, Fabio Kaminski <
>> > fabiokaminski@mobly.com.br
>> > > wrote:
>> >
>> > > is there anyone with experience in hbase 0.20 realtime application,
>> > > preferably in production environment?
>> > >
>> > > in thinking in throw away all my legacy knowledge about what i think
>> > about
>> > > systems.. cause i think this is the hadoop(and hbase) are the next big
>> > > thing
>> > > in tecnology.. i really buy this concept and im glady that i found it
>> > right
>> > > in it's inception.
>> > >
>> > > Im preparing to work with hadoop and hbase for realtime environment,
>> and
>> > i
>> > > could see that the hbase engineers are preparing hbase for realtime
>> > > applications, like rdbms standards does, but in a new and promissing
>> > > environment. this is undoubtly a paradigm shift!
>> > >
>> > > anyone with realtime application runing in such environment? could you
>> > > share
>> > > some of you experience with it?
>> > >
>> > > Thanks !
>> > >
>> > > Fabio Kaminski
>> > >
>> >
>>
>


-- 


Amandeep Khurana
Computer Science Graduate Student
University of California, Santa Cruz

Re: anyone with experience in hbase 0.20 realtime application?

Posted by Vaibhav Puranik <vp...@gmail.com>.
Amandeep,

We are caching Hbase results in memory (in a HashMap).

Regards,
Vaibhav

On Tue, Aug 11, 2009 at 12:56 PM, Amandeep Khurana <am...@gmail.com> wrote:

> Vaibhav,
>
> What kind of caching are you doing over hbase and how?
>
> -Amandeep
>
>
> Amandeep Khurana
> Computer Science Graduate Student
> University of California, Santa Cruz
>
>
> On Tue, Aug 11, 2009 at 10:48 AM, Vaibhav Puranik <vpuranik@gmail.com
> >wrote:
>
> > We are using HBase 0.20 (Trunk version at 23rd July evening) in
> production
> > environment at GumGum.
> >
> > Our experience is very good. Initially I mistakenly forgot to add caching
> > (even though we had planned for it) and every request was fetching two
> rows
> > from Hbase and inserting one row in HBase.
> > In spite of that our request processing time was less than 300 ms.
> >
> > We are not getting huge amounts of requests - we approximately get 25,000
> > to
> > 30,000 requests to our web app backed by HBase every day.
> >
> > We have a 4 node cluster running on EC2 (Large instances) and so far we
> > haven't faced any production problem.
> > (Hope it works out that way all the time!)
> >
> > Regards,
> > Vaibhav Puranik,
> > GumGum
> >
> >
> >
> >
> > On Tue, Aug 11, 2009 at 10:29 AM, Fabio Kaminski <
> > fabiokaminski@mobly.com.br
> > > wrote:
> >
> > > is there anyone with experience in hbase 0.20 realtime application,
> > > preferably in production environment?
> > >
> > > in thinking in throw away all my legacy knowledge about what i think
> > about
> > > systems.. cause i think this is the hadoop(and hbase) are the next big
> > > thing
> > > in tecnology.. i really buy this concept and im glady that i found it
> > right
> > > in it's inception.
> > >
> > > Im preparing to work with hadoop and hbase for realtime environment,
> and
> > i
> > > could see that the hbase engineers are preparing hbase for realtime
> > > applications, like rdbms standards does, but in a new and promissing
> > > environment. this is undoubtly a paradigm shift!
> > >
> > > anyone with realtime application runing in such environment? could you
> > > share
> > > some of you experience with it?
> > >
> > > Thanks !
> > >
> > > Fabio Kaminski
> > >
> >
>

Re: anyone with experience in hbase 0.20 realtime application?

Posted by Amandeep Khurana <am...@gmail.com>.
Vaibhav,

What kind of caching are you doing over hbase and how?

-Amandeep


Amandeep Khurana
Computer Science Graduate Student
University of California, Santa Cruz


On Tue, Aug 11, 2009 at 10:48 AM, Vaibhav Puranik <vp...@gmail.com>wrote:

> We are using HBase 0.20 (Trunk version at 23rd July evening) in production
> environment at GumGum.
>
> Our experience is very good. Initially I mistakenly forgot to add caching
> (even though we had planned for it) and every request was fetching two rows
> from Hbase and inserting one row in HBase.
> In spite of that our request processing time was less than 300 ms.
>
> We are not getting huge amounts of requests - we approximately get 25,000
> to
> 30,000 requests to our web app backed by HBase every day.
>
> We have a 4 node cluster running on EC2 (Large instances) and so far we
> haven't faced any production problem.
> (Hope it works out that way all the time!)
>
> Regards,
> Vaibhav Puranik,
> GumGum
>
>
>
>
> On Tue, Aug 11, 2009 at 10:29 AM, Fabio Kaminski <
> fabiokaminski@mobly.com.br
> > wrote:
>
> > is there anyone with experience in hbase 0.20 realtime application,
> > preferably in production environment?
> >
> > in thinking in throw away all my legacy knowledge about what i think
> about
> > systems.. cause i think this is the hadoop(and hbase) are the next big
> > thing
> > in tecnology.. i really buy this concept and im glady that i found it
> right
> > in it's inception.
> >
> > Im preparing to work with hadoop and hbase for realtime environment, and
> i
> > could see that the hbase engineers are preparing hbase for realtime
> > applications, like rdbms standards does, but in a new and promissing
> > environment. this is undoubtly a paradigm shift!
> >
> > anyone with realtime application runing in such environment? could you
> > share
> > some of you experience with it?
> >
> > Thanks !
> >
> > Fabio Kaminski
> >
>

Re: anyone with experience in hbase 0.20 realtime application?

Posted by Jae Joo <ja...@gmail.com>.
Would you please share how many records you have in hbase?

Jae



On Tue, Aug 11, 2009 at 1:48 PM, Vaibhav Puranik <vp...@gmail.com> wrote:

> We are using HBase 0.20 (Trunk version at 23rd July evening) in production
> environment at GumGum.
>
> Our experience is very good. Initially I mistakenly forgot to add caching
> (even though we had planned for it) and every request was fetching two rows
> from Hbase and inserting one row in HBase.
> In spite of that our request processing time was less than 300 ms.
>
> We are not getting huge amounts of requests - we approximately get 25,000
> to
> 30,000 requests to our web app backed by HBase every day.
>
> We have a 4 node cluster running on EC2 (Large instances) and so far we
> haven't faced any production problem.
> (Hope it works out that way all the time!)
>
> Regards,
> Vaibhav Puranik,
> GumGum
>
>
>
>
> On Tue, Aug 11, 2009 at 10:29 AM, Fabio Kaminski <
> fabiokaminski@mobly.com.br
> > wrote:
>
> > is there anyone with experience in hbase 0.20 realtime application,
> > preferably in production environment?
> >
> > in thinking in throw away all my legacy knowledge about what i think
> about
> > systems.. cause i think this is the hadoop(and hbase) are the next big
> > thing
> > in tecnology.. i really buy this concept and im glady that i found it
> right
> > in it's inception.
> >
> > Im preparing to work with hadoop and hbase for realtime environment, and
> i
> > could see that the hbase engineers are preparing hbase for realtime
> > applications, like rdbms standards does, but in a new and promissing
> > environment. this is undoubtly a paradigm shift!
> >
> > anyone with realtime application runing in such environment? could you
> > share
> > some of you experience with it?
> >
> > Thanks !
> >
> > Fabio Kaminski
> >
>

Re: anyone with experience in hbase 0.20 realtime application?

Posted by Ryan Rawson <ry...@gmail.com>.
If you guys move from ec2 to self-hosted hardware you should be able
to see significantly better response time.  I can easily get ~1ms
response times on reading 1 row or so from php via thrift.

-ryan

On Tue, Aug 11, 2009 at 10:48 AM, Vaibhav Puranik<vp...@gmail.com> wrote:
> We are using HBase 0.20 (Trunk version at 23rd July evening) in production
> environment at GumGum.
>
> Our experience is very good. Initially I mistakenly forgot to add caching
> (even though we had planned for it) and every request was fetching two rows
> from Hbase and inserting one row in HBase.
> In spite of that our request processing time was less than 300 ms.
>
> We are not getting huge amounts of requests - we approximately get 25,000 to
> 30,000 requests to our web app backed by HBase every day.
>
> We have a 4 node cluster running on EC2 (Large instances) and so far we
> haven't faced any production problem.
> (Hope it works out that way all the time!)
>
> Regards,
> Vaibhav Puranik,
> GumGum
>
>
>
>
> On Tue, Aug 11, 2009 at 10:29 AM, Fabio Kaminski <fabiokaminski@mobly.com.br
>> wrote:
>
>> is there anyone with experience in hbase 0.20 realtime application,
>> preferably in production environment?
>>
>> in thinking in throw away all my legacy knowledge about what i think about
>> systems.. cause i think this is the hadoop(and hbase) are the next big
>> thing
>> in tecnology.. i really buy this concept and im glady that i found it right
>> in it's inception.
>>
>> Im preparing to work with hadoop and hbase for realtime environment, and i
>> could see that the hbase engineers are preparing hbase for realtime
>> applications, like rdbms standards does, but in a new and promissing
>> environment. this is undoubtly a paradigm shift!
>>
>> anyone with realtime application runing in such environment? could you
>> share
>> some of you experience with it?
>>
>> Thanks !
>>
>> Fabio Kaminski
>>
>

Re: anyone with experience in hbase 0.20 realtime application?

Posted by Vaibhav Puranik <vp...@gmail.com>.
We are using HBase 0.20 (Trunk version at 23rd July evening) in production
environment at GumGum.

Our experience is very good. Initially I mistakenly forgot to add caching
(even though we had planned for it) and every request was fetching two rows
from Hbase and inserting one row in HBase.
In spite of that our request processing time was less than 300 ms.

We are not getting huge amounts of requests - we approximately get 25,000 to
30,000 requests to our web app backed by HBase every day.

We have a 4 node cluster running on EC2 (Large instances) and so far we
haven't faced any production problem.
(Hope it works out that way all the time!)

Regards,
Vaibhav Puranik,
GumGum




On Tue, Aug 11, 2009 at 10:29 AM, Fabio Kaminski <fabiokaminski@mobly.com.br
> wrote:

> is there anyone with experience in hbase 0.20 realtime application,
> preferably in production environment?
>
> in thinking in throw away all my legacy knowledge about what i think about
> systems.. cause i think this is the hadoop(and hbase) are the next big
> thing
> in tecnology.. i really buy this concept and im glady that i found it right
> in it's inception.
>
> Im preparing to work with hadoop and hbase for realtime environment, and i
> could see that the hbase engineers are preparing hbase for realtime
> applications, like rdbms standards does, but in a new and promissing
> environment. this is undoubtly a paradigm shift!
>
> anyone with realtime application runing in such environment? could you
> share
> some of you experience with it?
>
> Thanks !
>
> Fabio Kaminski
>

Re: anyone with experience in hbase 0.20 realtime application?

Posted by Amandeep Khurana <am...@gmail.com>.
I'm working on an app that will be feeding off hbase in real time. However,
its not in production yet. The front end/service later is yet to be
completed and tested. The initial testing of a prototype did look promising
though.

On Tue, Aug 11, 2009 at 10:29 AM, Fabio Kaminski <fabiokaminski@mobly.com.br
> wrote:

> is there anyone with experience in hbase 0.20 realtime application,
> preferably in production environment?
>
> in thinking in throw away all my legacy knowledge about what i think about
> systems.. cause i think this is the hadoop(and hbase) are the next big
> thing
> in tecnology.. i really buy this concept and im glady that i found it right
> in it's inception.
>
> Im preparing to work with hadoop and hbase for realtime environment, and i
> could see that the hbase engineers are preparing hbase for realtime
> applications, like rdbms standards does, but in a new and promissing
> environment. this is undoubtly a paradigm shift!
>
> anyone with realtime application runing in such environment? could you
> share
> some of you experience with it?
>
> Thanks !
>
> Fabio Kaminski
>