You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Jonathan Ellis (JIRA)" <ji...@apache.org> on 2013/12/06 18:15:35 UTC

[jira] [Updated] (CASSANDRA-4165) Generate Digest file for compressed SSTables

     [ https://issues.apache.org/jira/browse/CASSANDRA-4165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jonathan Ellis updated CASSANDRA-4165:
--------------------------------------

      Component/s: Core
    Fix Version/s:     (was: 2.0.4)
                   2.1

It turns out that uncompressed writes spend about 20% of their time computing the sha.

I think we should switch to adler here as well as at the block level.  (I note that adler-capable commandline tools exist for RHEL and Debian as well as scripting languages Perl, Python, Ruby.)

(Still need to avoid computing redundant CRC component for compressed files too.)

/cc [~pkolaczk] [~benedict]

> Generate Digest file for compressed SSTables
> --------------------------------------------
>
>                 Key: CASSANDRA-4165
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4165
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Core
>            Reporter: Marcus Eriksson
>            Assignee: Marcus Eriksson
>            Priority: Minor
>              Labels: performance
>             Fix For: 2.1
>
>         Attachments: 0001-Generate-digest-for-compressed-files-as-well.patch, 4165-rebased.txt
>
>
> We use the generated *Digest.sha1-files to verify backups, would be nice if they were generated for compressed sstables as well.



--
This message was sent by Atlassian JIRA
(v6.1#6144)