You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hbase.apache.org by Billy Pearson <sa...@pearsonwholesale.com> on 2008/11/09 04:52:35 UTC
hbase Partitioner for MR Jobs
Does anyone out there have any experience writing a hadoop partitioner we
need on for hbase to split the records from Map outputs
So that all records will for a region will fall in one partition would need
something that is fast as each output would have to be ran by it
Then if we setup our new TableMapReduceUtil
TableMapReduceUtil.initTableReduceJob(table, reducer, job)
to set the number of reduces to the same number of regions
we could make sure no more then one reduce task was writing
to more then one region at a time.