You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Sandy Pratt (JIRA)" <ji...@apache.org> on 2013/06/05 10:09:20 UTC

[jira] [Created] (HBASE-8691) High-Throuhput Streaming Scan API

Sandy Pratt created HBASE-8691:
----------------------------------

             Summary: High-Throuhput Streaming Scan API
                 Key: HBASE-8691
                 URL: https://issues.apache.org/jira/browse/HBASE-8691
             Project: HBase
          Issue Type: Improvement
          Components: Scanners
    Affects Versions: 0.95.0
            Reporter: Sandy Pratt
         Attachments: HRegionServlet.java, README.txt, RecordReceiver.java, ScannerTest.java, StreamHRegionServer.java, StreamReceiverDirect.java, StreamServletDirect.java

I've done some working testing various ways to refactor and optimize Scans in HBase, and have found that performance can be dramatically increased by the addition of a streaming scan API.  The attached code constitutes a proof of concept that shows performance increases of almost 4x in some workloads.

I'd appreciate testing, replication, and comments.  If the approach seems viable, I think such an API should be built into some future version of HBase.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira