You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Yuki Morishita (JIRA)" <ji...@apache.org> on 2013/06/18 04:55:22 UTC

[jira] [Reopened] (CASSANDRA-5555) Allow sstableloader to handle a larger number of files

     [ https://issues.apache.org/jira/browse/CASSANDRA-5555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Yuki Morishita reopened CASSANDRA-5555:
---------------------------------------


This fix can send wrong "estimated number of keys" for creating BF on the streamed node, since calculating estimate uses index summary.

My proposed fix is to make index summary completely optional. That is, when Summary.db file is present, load that and use it. We also add an option not to load Summary.db. And when the file is not present nor the user choose not to load the summary, we just scan sequentially on index file(Index.db) for "estimated number of keys".


                
> Allow sstableloader to handle a larger number of files
> ------------------------------------------------------
>
>                 Key: CASSANDRA-5555
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-5555
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Core, Tools
>            Reporter: Tyler Hobbs
>            Assignee: Jonathan Ellis
>             Fix For: 1.2.6
>
>         Attachments: 5555-01.txt, 5555-02.txt, CASSANDRA-5555.txt, CASSANDRA-5555.txt
>
>
> With the default heap size, sstableloader will OOM when there are roughly 25k files in the directory to load.  It's easy to reach this number of files in a single LCS column family.
> By avoiding creating all SSTableReaders up front in SSTableLoader, we should be able to increase the number of files that sstableloader can handle considerably.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira