You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hbase.apache.org by Rajeshkumar J <ra...@gmail.com> on 2017/05/12 09:08:35 UTC

mapreduce over hbase table

Hi,

   I am running a mapreduce job over hbase table. I went through some blogs
and in that it is mentioned that data locality will be achieved for this
job other than some cases like region reassignment and others. When I run
mapreduce job over hbase table it takes each region as input for each
mapper. My question is how this job knows which region exists where? can
anyone explain this flow

Thanks

Re: mapreduce over hbase table

Posted by Ted Yu <yu...@gmail.com>.
Please take a look at the getSplits() methods in:

hbase-server/src/main/java/org/apache/hadoop/hbase/mapreduce/TableInputFormat.java
hbase-server/src/main/java/org/apache/hadoop/hbase/mapred/TableInputFormatBase.java

Cheers

On Fri, May 12, 2017 at 2:08 AM, Rajeshkumar J <ra...@gmail.com>
wrote:

> Hi,
>
>    I am running a mapreduce job over hbase table. I went through some blogs
> and in that it is mentioned that data locality will be achieved for this
> job other than some cases like region reassignment and others. When I run
> mapreduce job over hbase table it takes each region as input for each
> mapper. My question is how this job knows which region exists where? can
> anyone explain this flow
>
> Thanks
>