You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Jonathan Ellis (JIRA)" <ji...@apache.org> on 2010/06/17 08:12:28 UTC
[jira] Commented: (CASSANDRA-579) Stream SSTables without
Anti-compaction
[ https://issues.apache.org/jira/browse/CASSANDRA-579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12879683#action_12879683 ]
Jonathan Ellis commented on CASSANDRA-579:
------------------------------------------
// FIXME: extremely rough estimate of keys in the sstable
I think we do need a better key count estimate. What about reading the first 1% (by size) of the rows, and estimating from that?
Alternatively streamin could actually count the rows as they arrive.
> Stream SSTables without Anti-compaction
> ---------------------------------------
>
> Key: CASSANDRA-579
> URL: https://issues.apache.org/jira/browse/CASSANDRA-579
> Project: Cassandra
> Issue Type: Improvement
> Reporter: Stu Hood
> Assignee: Stu Hood
> Priority: Critical
> Fix For: 0.7
>
> Attachments: 0001-Extract-index-filter-writing-into-IndexWriter.patch, 0002-Add-recovery-for-non-essential-sstable-components.patch, 0003-Only-send-the-datafile-when-streaming.patch, 0004-Stream-minimal-sections-of-SSTables-without-compacti.patch, 0005-Add-greater-than-operation-for-sstable-indexes-to-op.patch
>
>
> The io.Streaming API currently requires a file on disk to stream, which means that bootstrap and repairs need to perform an anti-compaction that writes a bunch of data to disk, only to have it be deleted after the streaming has finished.
> EDIT: Deleted reference to using streaming as a client API: it wouldn't provide enough benefit over using the BMT interface, due to fragility.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.