You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@cassandra.apache.org by Robert Coli <rc...@eventbrite.com> on 2014/01/30 20:41:00 UTC
Re: Restoring keyspace using snapshots
On Wed, Jan 29, 2014 at 9:45 PM, Senthil, Athinanthny X. -ND <
Athinanthny.X.Senthil.-ND@disney.com> wrote:
> Plan to backup and restore keyspace from PROD to PRE-PROD cluster which
> has same number of nodes. Keyspace will have few hundred millions of rows.
> We need to do this every other week. Which one of the below options most
> time-efficient and puts less stress on target cluster ? We want to finish
> backup and restore in low usage time window.
>
http://www.palominodb.com/blog/2012/09/25/bulk-loading-options-cassandra
Has some details on when each approach may be better or worse. In your
case, you should probably just do the "copy-the-sstables" method. If the
target cluster has the same number of nodes, just assign it the same tokens
and then just copy SSTables from SOURCE_NODE_A to TARGET_NODE_A and so on.
If you do that, you don't even have to run cleanup, because no nodes have
changed their range ownership.
Don't use refresh if you don't need to, just (coalesce the target cluster,
load schema and then) copy the SSTables into the dir with the node down,
and then start it.
Refresh's current design is unsafe :
https://issues.apache.org/jira/browse/CASSANDRA-6245
https://issues.apache.org/jira/browse/CASSANDRA-6514
=Rob