You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "stack (JIRA)" <ji...@apache.org> on 2012/11/26 20:26:58 UTC

[jira] [Updated] (HBASE-7217) Documentation: Update section 11.5.1 to recommend that hbase.regionserver.checksum.verify is set

     [ https://issues.apache.org/jira/browse/HBASE-7217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

stack updated HBASE-7217:
-------------------------

    Attachment: 7217.txt

Patch w/ Neil's fix (Move the 'HDFS Configuration' section under the 'HDFS' section that comes later too).
                
> Documentation: Update section 11.5.1 to recommend that hbase.regionserver.checksum.verify is set
> ------------------------------------------------------------------------------------------------
>
>                 Key: HBASE-7217
>                 URL: https://issues.apache.org/jira/browse/HBASE-7217
>             Project: HBase
>          Issue Type: Improvement
>          Components: documentation
>    Affects Versions: 0.94.2
>            Reporter: Neil Ferguson
>            Priority: Minor
>             Fix For: 0.96.0
>
>         Attachments: 7217.txt
>
>
> When short-circuit reads are enabled (dfs.client.read.shortcircuit = true), reading with checksums enabled (dfs.client.read.shortcircuit.skip.checksum = false) follows a completely different, and much slower, path to reading with checksums disabled (BlockReaderLocal uses something called a "slow buffer", which is, unsurprisingly, slow). My tests show that this path is actually slower than having short-circuit reads disabled.
> Therefore, I think section 11.5.1 of the HBase documentation should recommend that hbase.regionserver.checksum.verify be set to true when using short-circuit reads. I'd suggest the following:
> "For optimal performance when short-circuit reads are enabled, it is recommended that HDFS checksums are disabled. To maintain data integrity with HDFS checksums disabled, HBase can be configured to write its own checksums into its datablocks and verify against these. See Section 11.4.9. "hbase.regionserver.checksum.verify".

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira