You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@geode.apache.org by "Alexander Murmann (Jira)" <ji...@apache.org> on 2021/01/07 18:59:00 UTC

[jira] [Commented] (GEODE-8739) Split brain when locators exhaust join attempts on non existant servers

    [ https://issues.apache.org/jira/browse/GEODE-8739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17260734#comment-17260734 ] 

Alexander Murmann commented on GEODE-8739:
------------------------------------------

{quote}The task is to see if there is an appropriate place to document the fact that if an operator shuts down a whole cluster and leaves storage for .dat files intact, that they should delete those .dat files before restarting the cluster.
{quote}
[~burcham] Has this happened? This seems like a rather serious problem if an operator isn't aware of it that might result in many user headaches.

> Split brain when locators exhaust join attempts on non existant servers
> -----------------------------------------------------------------------
>
>                 Key: GEODE-8739
>                 URL: https://issues.apache.org/jira/browse/GEODE-8739
>             Project: Geode
>          Issue Type: Bug
>          Components: membership
>            Reporter: Jason Huynh
>            Priority: Major
>         Attachments: exportedLogs_locator-0.zip, exportedLogs_locator-1.zip
>
>
> The hypothesis: "if there is a locator view .dat file with several non-existent servers then then locators will waste all of their join attempts on the servers instead of finding each other"
> Scenario is a test/user attempts to recreate a cluster with existing .dat and persistent files.  The locators are spun in parallel and from the analysis, it looks like they are able to communicate with each other, but then end up forming their own ds.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)