You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by Pranav Khaitan <pr...@gmail.com> on 2010/10/03 11:14:43 UTC

Re: Review Request: Reseeking directly to required columns

-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
http://review.cloudera.org/r/781/#review1382
-----------------------------------------------------------



trunk/src/main/java/org/apache/hadoop/hbase/regionserver/ScanQueryMatcher.java
<http://review.cloudera.org/r/781/#comment4769>

    Kannan, what you are suggesting is the right thing to do but I don't think it is required much here. This function is called only when the ScanQueryMatcher decides that it wants to go to the next column (and not next row). If the ScanQueryMatcher knows that all columns are over, then it will return SEEK_NEXT_ROW instead of SEEK_NEXT_COL. In that case, this function will never be called. Therefore, nextColumn will only be null when we are using WildcardColumnTracker and do not have explicit column information.


- Pranav


On 2010-09-16 00:57:12, Pranav Khaitan wrote:
> 
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> http://review.cloudera.org/r/781/
> -----------------------------------------------------------
> 
> (Updated 2010-09-16 00:57:12)
> 
> 
> Review request for hbase, stack, Jonathan Gray, Karthik Ranganathan, and Kannan Muthukkaruppan.
> 
> 
> Summary
> -------
> 
> Optimize reads for specific columns by reseeking between scans. Use the reseek logic to jump directly to next required column rather than reading current column.
> 
> Big performance gain for queries with sparse columns. Not advantageous for dense ones. Consider this before comitting.
> 
> Further suggestions/questions are welcome!
> 
> 
> This addresses bugs HBASE-2450, HBASE-2916 and HBASE-2959.
>     http://issues.apache.org/jira/browse/HBASE-2450
>     http://issues.apache.org/jira/browse/HBASE-2916
>     http://issues.apache.org/jira/browse/HBASE-2959
> 
> 
> Diffs
> -----
> 
>   trunk/src/main/java/org/apache/hadoop/hbase/HConstants.java 990674 
>   trunk/src/main/java/org/apache/hadoop/hbase/KeyValue.java 990674 
>   trunk/src/main/java/org/apache/hadoop/hbase/regionserver/ScanQueryMatcher.java 990674 
>   trunk/src/main/java/org/apache/hadoop/hbase/regionserver/StoreScanner.java 990674 
> 
> Diff: http://review.cloudera.org/r/781/diff
> 
> 
> Testing
> -------
> 
> All existing tests pass and make significant use of this code. 
> 
> Added a new test file called TestColumnSeeking along with another patch at https://review.cloudera.org/r/780/.
> 
> 
> Thanks,
> 
> Pranav
> 
>