You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Jonathan Ellis (JIRA)" <ji...@apache.org> on 2010/06/17 08:12:28 UTC

[jira] Commented: (CASSANDRA-579) Stream SSTables without Anti-compaction

    [ https://issues.apache.org/jira/browse/CASSANDRA-579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12879683#action_12879683 ] 

Jonathan Ellis commented on CASSANDRA-579:
------------------------------------------

        // FIXME: extremely rough estimate of keys in the sstable

I think we do need a better key count estimate.  What about reading the first 1% (by size) of the rows, and estimating from that?

Alternatively streamin could actually count the rows as they arrive.

> Stream SSTables without Anti-compaction
> ---------------------------------------
>
>                 Key: CASSANDRA-579
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-579
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: Stu Hood
>            Assignee: Stu Hood
>            Priority: Critical
>             Fix For: 0.7
>
>         Attachments: 0001-Extract-index-filter-writing-into-IndexWriter.patch, 0002-Add-recovery-for-non-essential-sstable-components.patch, 0003-Only-send-the-datafile-when-streaming.patch, 0004-Stream-minimal-sections-of-SSTables-without-compacti.patch, 0005-Add-greater-than-operation-for-sstable-indexes-to-op.patch
>
>
> The io.Streaming API currently requires a file on disk to stream, which means that bootstrap and repairs need to perform an anti-compaction that writes a bunch of data to disk, only to have it be deleted after the streaming has finished.
> EDIT: Deleted reference to using streaming as a client API: it wouldn't provide enough benefit over using the BMT interface, due to fragility.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.