You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@cassandra.apache.org by David Koblas <da...@koblas.com> on 2012/03/05 16:42:42 UTC

Adding a second datacenter

Everything that I've read about data centers focuses on setting things 
up at the beginning of time.

I've the the following situation:

10 machines in a datacenter (DC1), with replication factor of 2.

I want to set up a second data center (DC2) with the following 
configuration:
   20 machines with a replication factor of 4

What I've found is that if I initially start adding things, the first 
machine to join the network attempts to replicate all of the data from 
DC1 and fills up it's disk drive.  I've played with setting the 
storage_options to have a replication factor of 0, then I can bring up 
all 20 machines in DC2 but then start getting a huge number of read 
errors from read on DC1.

Is there a simple cookbook on how to add a second DC?  I'm currently 
trying to set the replication factor to 1 and do a repair, but that 
doesn't feel like the right approach.

Thanks,

Re: Adding a second datacenter

Posted by Rob Coli <rc...@palominodb.com>.

On Tue, Apr 24, 2012 at 3:24 PM, Bill Au <bi...@gmail.com> wrote:
> Everything went smoothly until I ran the last step, which is to run nodetool
> repair on all the nodes in the new data center.  Repair is hanging on all
> the new nodes.  I had to hit control-C to break out of it.
> [ snip ]
> Did I missed anything or did something wrong?  How do I recover from this?

http://wiki.apache.org/cassandra/Operations
"
Running nodetool repair: Like all nodetool operations in 0.7, repair
is blocking: it will wait for the repair to finish and then exit. This
may take a long time on large data sets.
"

Since 0.7, all nodetool operations are blocking. While "repair" does
in fact have bugs which make it possible that it will hang in all
extant release versions, the fact that nodetool repair (hopefully you
were using -pr option?) takes a long time to return does not indicate
that it is hanging.

If you see repair and AES messages in system.log, it is probably not
in fact hung. If you don't see said messages for a long time, it might
be hung, in which case the only remedy currently available to you is
to restart the affected nodes.

=Rob
PS - I know this is a reply on a relatively old thread and I think you
maybe received assistance on another thread after this one. If so,
apologies!

-- 
=Robert Coli
AIM&GTALK - rcoli@palominodb.com
YAHOO - rcoli.palominob
SKYPE - rcoli_palominodb

Re: Adding a second datacenter

Posted by Bill Au <bi...@gmail.com>.

I just followed the step outlined in this email thread to add a second data
center to my existing cluster.  I am running 1.0.8.  Each data center has a
replication factor of 2.  I am using local quorum for read and write.

Everything went smoothly until I ran the last step, which is to run
nodetool repair on all the nodes in the new data center.  Repair is hanging
on all the new nodes.  I had to hit control-C to break out of it.  I do see
repair and AntiEntropySession messages in system.log.  I also see some
"Received merkle tree" message, and it does look like some data is being
added to the new node.  But repair never returns and there are no new
messages in system.log.

I ran repair a second time but it is still hanging.

Did I missed anything or did something wrong?  How do I recover from this?

Bill

On Mon, Mar 5, 2012 at 11:34 AM, David Koblas <da...@koblas.com> wrote:

> Jeremiah,
>
> Thanks!
>
> I'm running 1.0.8, two interesting things to note:
>
> - I don't have sufficient disk space to handle the straight bump to a
> replication factor of 4, so I think I'm going to have to do it one by one
> (1,2,3 and 4) with a bunch of cleanups in between.
>
> - Also, using a LOCAL_QUORUM doesn't work since my application has a hard
> response time limit then my read speed ends up being the speed of the
> slowest node.  What I want is LOCAL_ONE which doesn't exist in the API
> (unless I missed something).
>
> Yes, CASSANDRA-3483 is really what I'm looking for.
>
> --david
>
>
> On 3/5/12 8:02 AM, Jeremiah Jordan wrote:
>
>> You need to make sure your clients are reading using LOCAL_* settings so
>> that they don't try to get data from the other data center.  But you
>> shouldn't get errors while replication_factor is 0.  Once you change the
>> replication factor to 4, you should get missing data if you are using
>> LOCAL_* for reading.
>>
>> What version are you using?
>>
>> See the IRC logs at the begining of this JIRA discussion thread for some
>> info:
>>
>> https://issues.apache.org/**jira/browse/CASSANDRA-3483<https://issues.apache.org/jira/browse/CASSANDRA-3483>
>>
>> But you should be able to:
>> 1. Set dc2:0 in the replication_factor.
>> 2. Set bootstrap to false on the new nodes.
>> 2. Start all of the new nodes.
>> 3. Change replication_factor to dc2:4
>> 4. run repair on the nodes in dc2.
>>
>> Once the repairs finish you should be able to start using DC2.  You are
>> still going to need a bunch of extra space because the repair is going to
>> get you a couple copies of the data.
>>
>> Once 1.1 comes out it will have new nodetool commands for making this a
>> little nicer per CASSANDRA-3483
>>
>> -Jeremiah
>>
>>
>> On 03/05/2012 09:42 AM, David Koblas wrote:
>>
>>> Everything that I've read about data centers focuses on setting things
>>> up at the beginning of time.
>>>
>>> I've the the following situation:
>>>
>>> 10 machines in a datacenter (DC1), with replication factor of 2.
>>>
>>> I want to set up a second data center (DC2) with the following
>>> configuration:
>>>  20 machines with a replication factor of 4
>>>
>>> What I've found is that if I initially start adding things, the first
>>> machine to join the network attempts to replicate all of the data from DC1
>>> and fills up it's disk drive.  I've played with setting the storage_options
>>> to have a replication factor of 0, then I can bring up all 20 machines in
>>> DC2 but then start getting a huge number of read errors from read on DC1.
>>>
>>> Is there a simple cookbook on how to add a second DC?  I'm currently
>>> trying to set the replication factor to 1 and do a repair, but that doesn't
>>> feel like the right approach.
>>>
>>> Thanks,
>>>
>>>
>>>
>>>

Re: Adding a second datacenter

Posted by David Koblas <da...@koblas.com>.

Jeremiah,

Thanks!

I'm running 1.0.8, two interesting things to note:

- I don't have sufficient disk space to handle the straight bump to a 
replication factor of 4, so I think I'm going to have to do it one by 
one (1,2,3 and 4) with a bunch of cleanups in between.

- Also, using a LOCAL_QUORUM doesn't work since my application has a 
hard response time limit then my read speed ends up being the speed of 
the slowest node.  What I want is LOCAL_ONE which doesn't exist in the 
API (unless I missed something).

Yes, CASSANDRA-3483 is really what I'm looking for.

--david

On 3/5/12 8:02 AM, Jeremiah Jordan wrote:
> You need to make sure your clients are reading using LOCAL_* settings 
> so that they don't try to get data from the other data center.  But 
> you shouldn't get errors while replication_factor is 0.  Once you 
> change the replication factor to 4, you should get missing data if you 
> are using LOCAL_* for reading.
>
> What version are you using?
>
> See the IRC logs at the begining of this JIRA discussion thread for 
> some info:
>
> https://issues.apache.org/jira/browse/CASSANDRA-3483
>
> But you should be able to:
> 1. Set dc2:0 in the replication_factor.
> 2. Set bootstrap to false on the new nodes.
> 2. Start all of the new nodes.
> 3. Change replication_factor to dc2:4
> 4. run repair on the nodes in dc2.
>
> Once the repairs finish you should be able to start using DC2.  You 
> are still going to need a bunch of extra space because the repair is 
> going to get you a couple copies of the data.
>
> Once 1.1 comes out it will have new nodetool commands for making this 
> a little nicer per CASSANDRA-3483
>
> -Jeremiah
>
>
> On 03/05/2012 09:42 AM, David Koblas wrote:
>> Everything that I've read about data centers focuses on setting 
>> things up at the beginning of time.
>>
>> I've the the following situation:
>>
>> 10 machines in a datacenter (DC1), with replication factor of 2.
>>
>> I want to set up a second data center (DC2) with the following 
>> configuration:
>>   20 machines with a replication factor of 4
>>
>> What I've found is that if I initially start adding things, the first 
>> machine to join the network attempts to replicate all of the data 
>> from DC1 and fills up it's disk drive.  I've played with setting the 
>> storage_options to have a replication factor of 0, then I can bring 
>> up all 20 machines in DC2 but then start getting a huge number of 
>> read errors from read on DC1.
>>
>> Is there a simple cookbook on how to add a second DC?  I'm currently 
>> trying to set the replication factor to 1 and do a repair, but that 
>> doesn't feel like the right approach.
>>
>> Thanks,
>>
>>
>>

Re: Adding a second datacenter

Posted by Jeremiah Jordan <je...@morningstar.com>.

You need to make sure your clients are reading using LOCAL_* settings so 
that they don't try to get data from the other data center.  But you 
shouldn't get errors while replication_factor is 0.  Once you change the 
replication factor to 4, you should get missing data if you are using 
LOCAL_* for reading.

What version are you using?

See the IRC logs at the begining of this JIRA discussion thread for some 
info:

https://issues.apache.org/jira/browse/CASSANDRA-3483

But you should be able to:
1. Set dc2:0 in the replication_factor.
2. Set bootstrap to false on the new nodes.
2. Start all of the new nodes.
3. Change replication_factor to dc2:4
4. run repair on the nodes in dc2.

Once the repairs finish you should be able to start using DC2.  You are 
still going to need a bunch of extra space because the repair is going 
to get you a couple copies of the data.

Once 1.1 comes out it will have new nodetool commands for making this a 
little nicer per CASSANDRA-3483

-Jeremiah

On 03/05/2012 09:42 AM, David Koblas wrote:
> Everything that I've read about data centers focuses on setting things 
> up at the beginning of time.
>
> I've the the following situation:
>
> 10 machines in a datacenter (DC1), with replication factor of 2.
>
> I want to set up a second data center (DC2) with the following 
> configuration:
>   20 machines with a replication factor of 4
>
> What I've found is that if I initially start adding things, the first 
> machine to join the network attempts to replicate all of the data from 
> DC1 and fills up it's disk drive.  I've played with setting the 
> storage_options to have a replication factor of 0, then I can bring up 
> all 20 machines in DC2 but then start getting a huge number of read 
> errors from read on DC1.
>
> Is there a simple cookbook on how to add a second DC?  I'm currently 
> trying to set the replication factor to 1 and do a repair, but that 
> doesn't feel like the right approach.
>
> Thanks,
>
>
>