You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@hbase.apache.org by y_...@tsmc.com on 2010/01/04 08:55:42 UTC

HBase reading test

Hi,

There are 2 region servers(2G memory), 5 data nodes in my cluster.
I want to test HBase reading performance by writing a program with Hbase
client.
Inside that codes, I was using secondary index to scan the data I need,
that took 80 sec to fetch 5243 rows that was very cool!

Then I tried to deploy that program to another two machines, trying to test
hbase
ability of handling concurrent clients'reading.
Each client fetch the same data(5243 rows)
The Result is like following:
    1 concurrent client read:  80 sec
    2 concurrent client read: 104 sec
    3 concurrent client read: 232 sec
As above, increasing more concurrent client reading connections seems to
lower hbase performance too much.
Any opinions?




Fleming Chiu(邱宏明)
707-6128
y_823910@tsmc.com
週一無肉日吃素救地球(Meat Free Monday Taiwan)


 --------------------------------------------------------------------------- 
                                                         TSMC PROPERTY       
 This email communication (and any attachments) is proprietary information   
 for the sole use of its                                                     
 intended recipient. Any unauthorized review, use or distribution by anyone  
 other than the intended                                                     
 recipient is strictly prohibited.  If you are not the intended recipient,   
 please notify the sender by                                                 
 replying to this email, and then delete this email and any copies of it     
 immediately. Thank you.                                                     
 ---------------------------------------------------------------------------

Re: HBase reading test

Posted by stack <st...@duboce.net>.

My guess is that you have too little data.  Try adding 500k rows.  What is
your schema like?  What size is your data?
St.Ack

On Mon, Jan 4, 2010 at 10:44 PM, stack <st...@duboce.net> wrote:

> 2010/1/3 <y_...@tsmc.com>
>
>> Each client fetch the same data(5243 rows)
>> The Result is like following:
>>    1 concurrent client read:  80 sec
>>    2 concurrent client read: 104 sec
>>    3 concurrent client read: 232 sec
>> As above, increasing more concurrent client reading connections seems to
>> lower hbase performance too much.
>> Any opinions?
>>
>>
> Clients were all running in a single process?  If so, try running them as
> distinct processes.
> St.Ack
>

Re: HBase reading test

Posted by stack <st...@duboce.net>.

2010/1/3 <y_...@tsmc.com>

> Each client fetch the same data(5243 rows)
> The Result is like following:
>    1 concurrent client read:  80 sec
>    2 concurrent client read: 104 sec
>    3 concurrent client read: 232 sec
> As above, increasing more concurrent client reading connections seems to
> lower hbase performance too much.
> Any opinions?
>
>
Clients were all running in a single process?  If so, try running them as
distinct processes.
St.Ack