You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "Dan Washusen (JIRA)" <ji...@apache.org> on 2009/12/01 02:29:21 UTC
[jira] Commented: (HBASE-1935) Scan in parallel
[ https://issues.apache.org/jira/browse/HBASE-1935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783985#action_12783985 ]
Dan Washusen commented on HBASE-1935:
-------------------------------------
There is a minor bug in v2 of the patch. The logic in the ParallelScannerManager to determine if a scan is interested in a region doesn't handle the case when there is only one region.
The following fixes it:
{code}
Set<HRegionInfo> regions = table.getRegionsInfo().keySet();
for (HRegionInfo region : regions) {
...
boolean isScanInterestedInRegion = (scan.getStartRow().length == 0 && scan.getStopRow().length == 0) || regions.size() == 1;
{code}
> Scan in parallel
> ----------------
>
> Key: HBASE-1935
> URL: https://issues.apache.org/jira/browse/HBASE-1935
> Project: Hadoop HBase
> Issue Type: New Feature
> Reporter: stack
> Attachments: pscanner-v2.patch, pscanner.patch
>
>
> A scanner that rather than scan in series, instead scanned multiple regions in parallell would be more involved but could complete much faster partiularly if results are sparse.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.