You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@geode.apache.org by "ASF subversion and git services (JIRA)" <ji...@apache.org> on 2017/06/06 22:49:18 UTC

[jira] [Commented] (GEODE-3024) race condition between server and restarted locator preparing membership views

    [ https://issues.apache.org/jira/browse/GEODE-3024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16039817#comment-16039817 ] 

ASF subversion and git services commented on GEODE-3024:
--------------------------------------------------------

Commit 31b72ba48b2dda95954b30c14ae62a8730065b3f in geode's branch refs/heads/develop from [~bschuchardt]
[ https://git-wip-us.apache.org/repos/asf?p=geode.git;h=31b72ba ]

GEODE-3024 race condition between server locator preparing membership views

If a locator is preparing a conflicting membership view we now abandon
preparation of a view in a cache server and pause before retrying.
This gives the locator time to gather information from the cache server's
view (which it receives in acks while preparing its own view),
incorporate them into a new view and send it out.  When the cache
server installs the new view from the locator it will shut down its
ViewCreator thread.


> race condition between server and restarted locator preparing membership views
> ------------------------------------------------------------------------------
>
>                 Key: GEODE-3024
>                 URL: https://issues.apache.org/jira/browse/GEODE-3024
>             Project: Geode
>          Issue Type: Bug
>          Components: membership
>            Reporter: Bruce Schuchardt
>
> When a locator is restarted & recovers from disk it will try to take over the role of membership coordinator for the cluster if it finds the current coordinator is a cache server.  If the cache server is in the process of sending out a new view it may get into a race with the locator in sending out view preparation messages.
> The locator will send out a view-prep message and the server will also send one.  Responses to the view-prep message will include the conflicting view and each of the two processes will create a new view and send it out.  This repeats ad-infinitum.
> This problem was observed in a system that was shutting down at the same time a locator was being restarted.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)