You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hbase.apache.org by Rajeshkumar J <ra...@gmail.com> on 2017/05/22 09:17:23 UTC

hbase input split

Hi,

  Is there any way to customize hbase input split based on a column value.
For instance if i have a column family 'd' with 'country' column
qualifiers. 'country' column holds value such as USA,Australia,India,China.
So all the rows which holds country value as USA goes to 1 mapper then
Australia to another mapper then India to another mapper and likewise for
all the values.

Thanks

Re: hbase input split

Posted by Jean-Marc Spaggiari <je...@spaggiari.org>.
Hi Rajesh,

Not really. In HBase, data is ordered and stored based on the key. If you
want to split by another field, HBase has no clues about the content and
where to nicely split.

So you will run a mapper on HBase splits, and your logic in a reducer...
(Same logic with Spark)

JMS

2017-05-22 5:17 GMT-04:00 Rajeshkumar J <ra...@gmail.com>:

> Hi,
>
>   Is there any way to customize hbase input split based on a column value.
> For instance if i have a column family 'd' with 'country' column
> qualifiers. 'country' column holds value such as USA,Australia,India,China.
> So all the rows which holds country value as USA goes to 1 mapper then
> Australia to another mapper then India to another mapper and likewise for
> all the values.
>
> Thanks
>