You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@curator.apache.org by "Jordan Zimmerman (JIRA)" <ji...@apache.org> on 2013/07/13 00:11:53 UTC

[jira] [Comment Edited] (CURATOR-47) Two leaders in the leader election recipe

    [ https://issues.apache.org/jira/browse/CURATOR-47?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13707449#comment-13707449 ] 

Jordan Zimmerman edited comment on CURATOR-47 at 7/12/13 10:11 PM:
-------------------------------------------------------------------

I ran the test for a while (without the InstanceSpec changes) and was not able to reproduce (see log below). If you post your InstanceSpec changes I'll try those. BTW, I recently started an email thread on user@ZK titled "rolling config change considered harmful". Please have a look:

http://search-hadoop.com/m/6hmDl1xZTxY1&subj=Re+Rolling+config+change+considered+harmful+

t002043: Starting test.
t002057: Stopping server 0 iteration 0
t002905: 0 takes leadership. Leader was:-1
t006030: Stopping server 1 iteration 1
t006144: 0 looses leadership.
t007507: 1 takes leadership. Leader was:-1
t011064: Stopping server 2 iteration 2
t011178: 1 looses leadership.
t014713: 0 takes leadership. Leader was:-1
t015817: Stopping server 0 iteration 3
t015922: 0 looses leadership.
t016572: 1 takes leadership. Leader was:-1
t020794: Stopping server 1 iteration 4
t020899: 1 looses leadership.
t022001: 0 takes leadership. Leader was:-1
t025944: Stopping server 2 iteration 5
t026060: 0 looses leadership.
t029048: 1 takes leadership. Leader was:-1
t031184: Stopping server 0 iteration 6
t031289: 1 looses leadership.
t031796: 0 takes leadership. Leader was:-1
t035777: Stopping server 1 iteration 7
t035892: 0 looses leadership.
t037935: 1 takes leadership. Leader was:-1
t040831: Stopping server 2 iteration 8
t040954: 1 looses leadership.
t044511: 0 takes leadership. Leader was:-1
t046298: Stopping server 0 iteration 9
t046402: 0 looses leadership.
t047248: 1 takes leadership. Leader was:-1
t051618: Stopping server 1 iteration 10
t051721: 1 looses leadership.
t052597: 0 takes leadership. Leader was:-1
t057122: Stopping server 2 iteration 11
t057239: 0 looses leadership.
t062421: Stopping server 0 iteration 12
t066103: 0 takes leadership. Leader was:-1
t067015: Stopping server 1 iteration 13
t067120: 0 looses leadership.
t070908: 0 takes leadership. Leader was:-1
t072351: Stopping server 2 iteration 14
t072465: 0 looses leadership.
t074737: 0 takes leadership. Leader was:-1
t077254: Stopping server 0 iteration 15
t077356: 0 looses leadership.
t077564: 0 takes leadership. Leader was:-1
t082598: Stopping server 1 iteration 16
t082709: 0 looses leadership.
t083460: 0 takes leadership. Leader was:-1
t088007: Stopping server 2 iteration 17
t088122: 0 looses leadership.
t089385: 0 takes leadership. Leader was:-1
t093239: Stopping server 0 iteration 18
t093342: 0 looses leadership.
t093599: 0 takes leadership. Leader was:-1
t097752: Stopping server 1 iteration 19
t097862: 0 looses leadership.
t099856: 0 takes leadership. Leader was:-1
t103008: Stopping server 2 iteration 20
t103122: 0 looses leadership.
t104398: 0 takes leadership. Leader was:-1
t108164: Stopping server 0 iteration 21
t108266: 0 looses leadership.
t108842: 0 takes leadership. Leader was:-1
t112872: Stopping server 1 iteration 22
t112975: 0 looses leadership.
t114180: 0 takes leadership. Leader was:-1
t118172: Stopping server 2 iteration 23
t118288: 0 looses leadership.
t120987: 0 takes leadership. Leader was:-1
t123105: Stopping server 0 iteration 24
t123206: 0 looses leadership.
t124172: 0 takes leadership. Leader was:-1
t128560: Stopping server 1 iteration 25
t128667: 0 looses leadership.
t131217: 0 takes leadership. Leader was:-1
t133301: Stopping server 2 iteration 26
t133415: 0 looses leadership.
t135576: 0 takes leadership. Leader was:-1
t138116: Stopping server 0 iteration 27
t138219: 0 looses leadership.
t138493: 0 takes leadership. Leader was:-1
t143080: Stopping server 1 iteration 28
t143193: 0 looses leadership.
                
      was (Author: randgalt):
    I ran the test for a while (without the InstanceSpec changes) and was not able to reproduce (see log below). If you post your InstanceSpec changes I'll try those. BTW, I recently started an email thread on user@ZK titled "rolling config change considered harmful". Please have a look:

http://qnalist.com/questions/3983279/rolling-config-change-considered-harmful

t002043: Starting test.
t002057: Stopping server 0 iteration 0
t002905: 0 takes leadership. Leader was:-1
t006030: Stopping server 1 iteration 1
t006144: 0 looses leadership.
t007507: 1 takes leadership. Leader was:-1
t011064: Stopping server 2 iteration 2
t011178: 1 looses leadership.
t014713: 0 takes leadership. Leader was:-1
t015817: Stopping server 0 iteration 3
t015922: 0 looses leadership.
t016572: 1 takes leadership. Leader was:-1
t020794: Stopping server 1 iteration 4
t020899: 1 looses leadership.
t022001: 0 takes leadership. Leader was:-1
t025944: Stopping server 2 iteration 5
t026060: 0 looses leadership.
t029048: 1 takes leadership. Leader was:-1
t031184: Stopping server 0 iteration 6
t031289: 1 looses leadership.
t031796: 0 takes leadership. Leader was:-1
t035777: Stopping server 1 iteration 7
t035892: 0 looses leadership.
t037935: 1 takes leadership. Leader was:-1
t040831: Stopping server 2 iteration 8
t040954: 1 looses leadership.
t044511: 0 takes leadership. Leader was:-1
t046298: Stopping server 0 iteration 9
t046402: 0 looses leadership.
t047248: 1 takes leadership. Leader was:-1
t051618: Stopping server 1 iteration 10
t051721: 1 looses leadership.
t052597: 0 takes leadership. Leader was:-1
t057122: Stopping server 2 iteration 11
t057239: 0 looses leadership.
t062421: Stopping server 0 iteration 12
t066103: 0 takes leadership. Leader was:-1
t067015: Stopping server 1 iteration 13
t067120: 0 looses leadership.
t070908: 0 takes leadership. Leader was:-1
t072351: Stopping server 2 iteration 14
t072465: 0 looses leadership.
t074737: 0 takes leadership. Leader was:-1
t077254: Stopping server 0 iteration 15
t077356: 0 looses leadership.
t077564: 0 takes leadership. Leader was:-1
t082598: Stopping server 1 iteration 16
t082709: 0 looses leadership.
t083460: 0 takes leadership. Leader was:-1
t088007: Stopping server 2 iteration 17
t088122: 0 looses leadership.
t089385: 0 takes leadership. Leader was:-1
t093239: Stopping server 0 iteration 18
t093342: 0 looses leadership.
t093599: 0 takes leadership. Leader was:-1
t097752: Stopping server 1 iteration 19
t097862: 0 looses leadership.
t099856: 0 takes leadership. Leader was:-1
t103008: Stopping server 2 iteration 20
t103122: 0 looses leadership.
t104398: 0 takes leadership. Leader was:-1
t108164: Stopping server 0 iteration 21
t108266: 0 looses leadership.
t108842: 0 takes leadership. Leader was:-1
t112872: Stopping server 1 iteration 22
t112975: 0 looses leadership.
t114180: 0 takes leadership. Leader was:-1
t118172: Stopping server 2 iteration 23
t118288: 0 looses leadership.
t120987: 0 takes leadership. Leader was:-1
t123105: Stopping server 0 iteration 24
t123206: 0 looses leadership.
t124172: 0 takes leadership. Leader was:-1
t128560: Stopping server 1 iteration 25
t128667: 0 looses leadership.
t131217: 0 takes leadership. Leader was:-1
t133301: Stopping server 2 iteration 26
t133415: 0 looses leadership.
t135576: 0 takes leadership. Leader was:-1
t138116: Stopping server 0 iteration 27
t138219: 0 looses leadership.
t138493: 0 takes leadership. Leader was:-1
t143080: Stopping server 1 iteration 28
t143193: 0 looses leadership.

                  
> Two leaders in the leader election recipe
> -----------------------------------------
>
>                 Key: CURATOR-47
>                 URL: https://issues.apache.org/jira/browse/CURATOR-47
>             Project: Apache Curator
>          Issue Type: Bug
>          Components: Recipes
>    Affects Versions: 2.1.0-incubating
>         Environment: Windows and Java 1.7
>            Reporter: Germán Blanco
>             Fix For: TBD
>
>         Attachments: TestLeaderElection.java
>
>
> When using the Leader Election recipe and doing a rolling restart of the zookeeper servers in the cluster, it seems that at some points there are two participants that see themselves as leaders.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira