You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@hbase.apache.org by Daniel Leffel <da...@gmail.com> on 2008/05/10 22:49:13 UTC

Scanner

Is there a parallel scanner (I didn't see it in the documents)? How hard
would it be to create one that scans over regions on different servers
simultaneously? I mean, obviously, iteration order would not be
deterministic then, but that'd be ok. Would that actually make table scans
faster?

Re: Scanner

Posted by Bryan Duxbury <br...@rapleaf.com>.

The way to do parallel scanning is with a map/reduce job and  
TableInputFormat. This does all the work of parallelizing the scan,  
as well as whatever work you were doing.
-Bryan

On May 10, 2008, at 1:49 PM, Daniel Leffel wrote:

> Is there a parallel scanner (I didn't see it in the documents)? How  
> hard
> would it be to create one that scans over regions on different servers
> simultaneously? I mean, obviously, iteration order would not be
> deterministic then, but that'd be ok. Would that actually make  
> table scans
> faster?