You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "Andrew Purtell (JIRA)" <ji...@apache.org> on 2009/06/24 00:03:07 UTC

[jira] Commented: (HBASE-1576) TIF needs to be able to set scanner caching size for smaller row tables & performance

    [ https://issues.apache.org/jira/browse/HBASE-1576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12723327#action_12723327 ] 

Andrew Purtell commented on HBASE-1576:
---------------------------------------

The HBaseConfiguration object is created from the JobConf (TableInputFormat.java, line ~58), so isn't this sufficient?

{code}
JobConf job = new JobConf();
// ... 
job.set("hbase.client.scanner.caching", "100");
// ... 
{code}

No problem to make a convenience method, though...


> TIF needs to be able to set scanner caching size for smaller row tables & performance
> -------------------------------------------------------------------------------------
>
>                 Key: HBASE-1576
>                 URL: https://issues.apache.org/jira/browse/HBASE-1576
>             Project: Hadoop HBase
>          Issue Type: Bug
>          Components: mapred
>    Affects Versions: 0.20.0
>            Reporter: ryan rawson
>            Priority: Critical
>             Fix For: 0.20.0
>
>
> TIF goes with the default scanner caching size (1).  When each row is processed very fast and is small, this limits the overall performance.  By setting a higher scanner caching level you can achieve 100x+ the performance with the exact same map-reduce and table.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.