You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hive.apache.org by "Daniel,Wu" <ha...@163.com> on 2011/08/08 15:48:22 UTC

what's the benifit of integrate hbase with hive? For low latency?

  Hive document said hive is high latency, to query a table with about 100M might take 1 minute.  And hbase is a high performance database, so does that mean after integrate hive and hbase,  hive will get a better performance with lower latency?

Re: what's the benifit of integrate hbase with hive? For low latency?

Posted by Carl Steinbach <ca...@cloudera.com>.
Hi Daniel,

Integrating Hive with HBase does not solve the query latency problem, but it
does allow you to
run Hive queries against data which is being continuously updated --
something which is not
possible with Hive alone.

The following documents do a good job of discussing the use-cases that are
supported by the
Hive HBaseHandler:

http://www.cloudera.com/blog/2010/06/integrating-hive-and-hbase/

http://www.slideshare.net/hadoopusergroup/hive-h-basehadoopapr2010(particularly
slides 5-7)

Thanks.

Carl

2011/8/8 Daniel,Wu <ha...@163.com>

>   Hive document said hive is high latency, to query a table with about 100M
> might take 1 minute.  And hbase is a high performance database, so does that
> mean after integrate hive and hbase,  hive will get a better performance
> with lower latency?
>
>
>

RE: what's the benifit of integrate hbase with hive? For low latency?

Posted by "Aggarwal, Vaibhav" <va...@amazon.com>.
There are many potential benefits of using hive hbase handler.


1.       The most obvious is ability to run SQL like queries on your data instead of using hbase client.

2.       Ability to join data with other data sources like HDFS or S3.

3.       Ability to move data from your Hive tables into Hbase tables.

Thanks
Vaibhav

From: Daniel,Wu [mailto:hadoop_wu@163.com]
Sent: Monday, August 08, 2011 6:48 AM
To: user@hive.apache.org
Subject: what's the benifit of integrate hbase with hive? For low latency?

  Hive document said hive is high latency, to query a table with about 100M might take 1 minute.  And hbase is a high performance database, so does that mean after integrate hive and hbase,  hive will get a better performance with lower latency?