You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hive.apache.org by Manjula mohapatra <ma...@gmail.com> on 2014/02/20 10:47:28 UTC

Tuning Hive queries that uses underlying HBase Table

I am querying Hive table ( mapped to HBase Table ) .

What are the techniques to tune the Hive query and to avoid HBase scans.

Query uses multiple SPLIT and SUBSTR functions and WHERE  condition
something like

select  col1, col2, ...,count(*)
from hiveTable

where split( col1)[0] > timestamp1  and split( col1)[0]<timestamp2
group by ....