You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Sandy Pratt (JIRA)" <ji...@apache.org> on 2013/06/05 10:09:20 UTC

[jira] [Updated] (HBASE-8691) High-Throuhput Streaming Scan API

     [ https://issues.apache.org/jira/browse/HBASE-8691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sandy Pratt updated HBASE-8691:
-------------------------------

    Attachment: StreamServletDirect.java
                StreamReceiverDirect.java
                StreamHRegionServer.java
                ScannerTest.java
                RecordReceiver.java
                README.txt
                HRegionServlet.java

See README for details on how to apply this code to an existing environment.
                
> High-Throuhput Streaming Scan API
> ---------------------------------
>
>                 Key: HBASE-8691
>                 URL: https://issues.apache.org/jira/browse/HBASE-8691
>             Project: HBase
>          Issue Type: Improvement
>          Components: Scanners
>    Affects Versions: 0.95.0
>            Reporter: Sandy Pratt
>              Labels: patch, perfomance, scan
>         Attachments: HRegionServlet.java, README.txt, RecordReceiver.java, ScannerTest.java, StreamHRegionServer.java, StreamReceiverDirect.java, StreamServletDirect.java
>
>
> I've done some working testing various ways to refactor and optimize Scans in HBase, and have found that performance can be dramatically increased by the addition of a streaming scan API.  The attached code constitutes a proof of concept that shows performance increases of almost 4x in some workloads.
> I'd appreciate testing, replication, and comments.  If the approach seems viable, I think such an API should be built into some future version of HBase.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira