You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "Liyin Tang (JIRA)" <ji...@apache.org> on 2014/02/11 19:26:25 UTC

[jira] [Created] (HBASE-10502) [89-fb] ParallelScanner: a client utility to perform multiple scan requests in parallel.

Liyin Tang created HBASE-10502:
----------------------------------

             Summary: [89-fb] ParallelScanner: a client utility to perform multiple scan requests in parallel.
                 Key: HBASE-10502
                 URL: https://issues.apache.org/jira/browse/HBASE-10502
             Project: HBase
          Issue Type: New Feature
            Reporter: Liyin Tang
             Fix For: 0.89-fb


ParallelScanner is a utility class for the HBase client to perform multiple scan requests in parallel. It requires all the scan requests having the same caching size for the simplicity purpose. 
 
This class provides 3 very basic functionalities: 
* The initialize function will Initialize all the ResultScanners by calling {@link HTable#getScanner(Scan)} in parallel for each scan request.
* The next function will call the corresponding {@link ResultScanner#next(int numRows)} from each scan request in parallel, and then return all the results together as a list.  Also, if result list is empty, it indicates there is no data left for all the scanners and the user can call {@link #close()} afterwards.
* The close function will close all the scanners and shutdown the thread pool.




--
This message was sent by Atlassian JIRA
(v6.1.5#6160)