You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@couchdb.apache.org by Chris Thro <Ch...@citrix.com> on 2015/04/09 19:34:44 UTC

replication continuosly restarts

We are constantly seeing the following in the log:
[Thu, 09 Apr 2015 11:03:30 GMT] [error] [<0.105.0>] Error in replication `64f5d1684f0e22273f165de8b44893fc+continuous` (triggered by document `docs-g2w_couchdb3_las`): {checkpoint_commit_failure,<<"Error updating the source checkpoint document: conflict">>}
Restarting replication in 5 seconds.
[Thu, 09 Apr 2015 11:03:30 GMT] [error] [<0.25603.1698>] ** Generic server <0.25603.1698> terminating
** Last message in was {'$gen_cast',checkpoint}
** When Server state == {rep_state,

Replication is setup the following way:
1.dc1 <-> 1.dc2
2.dc1 <-> 2.dc2
1.dc1 <->2.dc1
1.dc2 <-> 2.dc2

We have 7 dbs replicating this way but only 5 of them are showing this problem.
Any help solving this would be greatly appreciated.

Thank you



RE: replication continuosly restarts

Posted by Chris Thro <Ch...@citrix.com>.
Thank you for the responses. The app uses a vip to connect to one of two couchdb hosts, but each host has it's own external ip/port combinatoion and replication is setup with those individual hosts/ip combinations. I don't know if we will be able to upgrade anytime soon we will have to look into that if that is the only option.

-----Original Message-----
From: Jan Lehnardt [mailto:jan@apache.org] 
Sent: Friday, April 10, 2015 4:34 AM
To: user@couchdb.apache.org
Subject: Re: replication continuosly restarts

As Alexander notes, UUIDs weren’t around in 1.2.x. Do any of the nodes happen to have the same (external) ip/port combination, maybe when ding something like IP failover?


> On 09 Apr 2015, at 22:53, Chris Thro <Ch...@citrix.com> wrote:
> 
> I get the following:
> {"error":"not_found","reason":"unknown_config_value"}
> 
> We are on version 1.2.2
> 
> -----Original Message-----
> From: Jan Lehnardt [mailto:jan@apache.org]
> Sent: Thursday, April 09, 2015 1:24 PM
> To: user@couchdb.apache.org
> Subject: Re: replication continuosly restarts
> 
> 
>> On 09 Apr 2015, at 19:34, Chris Thro <Ch...@citrix.com> wrote:
>> 
>> We are constantly seeing the following in the log:
>> [Thu, 09 Apr 2015 11:03:30 GMT] [error] [<0.105.0>] Error in 
>> replication `64f5d1684f0e22273f165de8b44893fc+continuous` (triggered by document `docs-g2w_couchdb3_las`): {checkpoint_commit_failure,<<"Error updating the source checkpoint document: conflict">>} Restarting replication in 5 seconds.
>> [Thu, 09 Apr 2015 11:03:30 GMT] [error] [<0.25603.1698>] ** Generic 
>> server <0.25603.1698> terminating
>> ** Last message in was {'$gen_cast',checkpoint}
>> ** When Server state == {rep_state,
>> 
>> Replication is setup the following way:
>> 1.dc1 <-> 1.dc2
>> 2.dc1 <-> 2.dc2
>> 1.dc1 <->2.dc1
>> 1.dc2 <-> 2.dc2
>> 
>> We have 7 dbs replicating this way but only 5 of them are showing this problem.
>> Any help solving this would be greatly appreciated.
> 
> Do any of the databases have the same value for http://127.0.0.1:5984/_config/couchdb/uuid?
> 
> Best
> Jan
> --
> Professional Support for Apache CouchDB:
> http://www.neighbourhood.ie/couchdb-support/
> 

--
Professional Support for Apache CouchDB:
http://www.neighbourhood.ie/couchdb-support/


Re: replication continuosly restarts

Posted by Jan Lehnardt <ja...@apache.org>.
As Alexander notes, UUIDs weren’t around in 1.2.x. Do any of the nodes happen to have the same (external) ip/port combination, maybe when ding something like IP failover?


> On 09 Apr 2015, at 22:53, Chris Thro <Ch...@citrix.com> wrote:
> 
> I get the following:
> {"error":"not_found","reason":"unknown_config_value"}
> 
> We are on version 1.2.2
> 
> -----Original Message-----
> From: Jan Lehnardt [mailto:jan@apache.org] 
> Sent: Thursday, April 09, 2015 1:24 PM
> To: user@couchdb.apache.org
> Subject: Re: replication continuosly restarts
> 
> 
>> On 09 Apr 2015, at 19:34, Chris Thro <Ch...@citrix.com> wrote:
>> 
>> We are constantly seeing the following in the log:
>> [Thu, 09 Apr 2015 11:03:30 GMT] [error] [<0.105.0>] Error in 
>> replication `64f5d1684f0e22273f165de8b44893fc+continuous` (triggered by document `docs-g2w_couchdb3_las`): {checkpoint_commit_failure,<<"Error updating the source checkpoint document: conflict">>} Restarting replication in 5 seconds.
>> [Thu, 09 Apr 2015 11:03:30 GMT] [error] [<0.25603.1698>] ** Generic 
>> server <0.25603.1698> terminating
>> ** Last message in was {'$gen_cast',checkpoint}
>> ** When Server state == {rep_state,
>> 
>> Replication is setup the following way:
>> 1.dc1 <-> 1.dc2
>> 2.dc1 <-> 2.dc2
>> 1.dc1 <->2.dc1
>> 1.dc2 <-> 2.dc2
>> 
>> We have 7 dbs replicating this way but only 5 of them are showing this problem.
>> Any help solving this would be greatly appreciated.
> 
> Do any of the databases have the same value for http://127.0.0.1:5984/_config/couchdb/uuid?
> 
> Best
> Jan
> --
> Professional Support for Apache CouchDB:
> http://www.neighbourhood.ie/couchdb-support/
> 

-- 
Professional Support for Apache CouchDB:
http://www.neighbourhood.ie/couchdb-support/


Re: replication continuosly restarts

Posted by Alexander Shorin <kx...@gmail.com>.
Server uuids were introduced in 1.3 release. As well, in this version
replicator was significantly improved.
Also, 1.2.2 is an ancient. Update to 1.6.1 is highly recommended.
--
,,,^..^,,,


On Thu, Apr 9, 2015 at 11:53 PM, Chris Thro <Ch...@citrix.com> wrote:
> I get the following:
> {"error":"not_found","reason":"unknown_config_value"}
>
> We are on version 1.2.2
>
> -----Original Message-----
> From: Jan Lehnardt [mailto:jan@apache.org]
> Sent: Thursday, April 09, 2015 1:24 PM
> To: user@couchdb.apache.org
> Subject: Re: replication continuosly restarts
>
>
>> On 09 Apr 2015, at 19:34, Chris Thro <Ch...@citrix.com> wrote:
>>
>> We are constantly seeing the following in the log:
>> [Thu, 09 Apr 2015 11:03:30 GMT] [error] [<0.105.0>] Error in
>> replication `64f5d1684f0e22273f165de8b44893fc+continuous` (triggered by document `docs-g2w_couchdb3_las`): {checkpoint_commit_failure,<<"Error updating the source checkpoint document: conflict">>} Restarting replication in 5 seconds.
>> [Thu, 09 Apr 2015 11:03:30 GMT] [error] [<0.25603.1698>] ** Generic
>> server <0.25603.1698> terminating
>> ** Last message in was {'$gen_cast',checkpoint}
>> ** When Server state == {rep_state,
>>
>> Replication is setup the following way:
>> 1.dc1 <-> 1.dc2
>> 2.dc1 <-> 2.dc2
>> 1.dc1 <->2.dc1
>> 1.dc2 <-> 2.dc2
>>
>> We have 7 dbs replicating this way but only 5 of them are showing this problem.
>> Any help solving this would be greatly appreciated.
>
> Do any of the databases have the same value for http://127.0.0.1:5984/_config/couchdb/uuid?
>
> Best
> Jan
> --
> Professional Support for Apache CouchDB:
> http://www.neighbourhood.ie/couchdb-support/
>

RE: replication continuosly restarts

Posted by Chris Thro <Ch...@citrix.com>.
I get the following:
{"error":"not_found","reason":"unknown_config_value"}

We are on version 1.2.2

-----Original Message-----
From: Jan Lehnardt [mailto:jan@apache.org] 
Sent: Thursday, April 09, 2015 1:24 PM
To: user@couchdb.apache.org
Subject: Re: replication continuosly restarts


> On 09 Apr 2015, at 19:34, Chris Thro <Ch...@citrix.com> wrote:
> 
> We are constantly seeing the following in the log:
> [Thu, 09 Apr 2015 11:03:30 GMT] [error] [<0.105.0>] Error in 
> replication `64f5d1684f0e22273f165de8b44893fc+continuous` (triggered by document `docs-g2w_couchdb3_las`): {checkpoint_commit_failure,<<"Error updating the source checkpoint document: conflict">>} Restarting replication in 5 seconds.
> [Thu, 09 Apr 2015 11:03:30 GMT] [error] [<0.25603.1698>] ** Generic 
> server <0.25603.1698> terminating
> ** Last message in was {'$gen_cast',checkpoint}
> ** When Server state == {rep_state,
> 
> Replication is setup the following way:
> 1.dc1 <-> 1.dc2
> 2.dc1 <-> 2.dc2
> 1.dc1 <->2.dc1
> 1.dc2 <-> 2.dc2
> 
> We have 7 dbs replicating this way but only 5 of them are showing this problem.
> Any help solving this would be greatly appreciated.

Do any of the databases have the same value for http://127.0.0.1:5984/_config/couchdb/uuid?

Best
Jan
--
Professional Support for Apache CouchDB:
http://www.neighbourhood.ie/couchdb-support/


Re: replication continuosly restarts

Posted by Jan Lehnardt <ja...@apache.org>.
> On 09 Apr 2015, at 19:34, Chris Thro <Ch...@citrix.com> wrote:
> 
> We are constantly seeing the following in the log:
> [Thu, 09 Apr 2015 11:03:30 GMT] [error] [<0.105.0>] Error in replication `64f5d1684f0e22273f165de8b44893fc+continuous` (triggered by document `docs-g2w_couchdb3_las`): {checkpoint_commit_failure,<<"Error updating the source checkpoint document: conflict">>}
> Restarting replication in 5 seconds.
> [Thu, 09 Apr 2015 11:03:30 GMT] [error] [<0.25603.1698>] ** Generic server <0.25603.1698> terminating
> ** Last message in was {'$gen_cast',checkpoint}
> ** When Server state == {rep_state,
> 
> Replication is setup the following way:
> 1.dc1 <-> 1.dc2
> 2.dc1 <-> 2.dc2
> 1.dc1 <->2.dc1
> 1.dc2 <-> 2.dc2
> 
> We have 7 dbs replicating this way but only 5 of them are showing this problem.
> Any help solving this would be greatly appreciated.

Do any of the databases have the same value for http://127.0.0.1:5984/_config/couchdb/uuid?

Best
Jan
-- 
Professional Support for Apache CouchDB:
http://www.neighbourhood.ie/couchdb-support/