You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@geode.apache.org by "Owen Nichols (Jira)" <ji...@apache.org> on 2022/06/22 20:47:04 UTC

[jira] [Closed] (GEODE-9910) Failure to auto-reconnect upon network partition

     [ https://issues.apache.org/jira/browse/GEODE-9910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Owen Nichols closed GEODE-9910.
-------------------------------

> Failure to auto-reconnect upon network partition
> ------------------------------------------------
>
>                 Key: GEODE-9910
>                 URL: https://issues.apache.org/jira/browse/GEODE-9910
>             Project: Geode
>          Issue Type: Bug
>    Affects Versions: 1.14.0
>            Reporter: Surya Mudundi
>            Assignee: Barrett Oglesby
>            Priority: Major
>              Labels: GeodeOperationAPI, blocks-1.15.0​, needsTriage, pull-request-available
>             Fix For: 1.12.10, 1.13.9, 1.14.5, 1.15.0
>
>         Attachments: geode-logs.zip
>
>
> Two node cluster with embedded locators failed to auto-reconnect when node-1 experienced network outage for couple of minutes and when node-1 recovered from the outage, node-2 failed to auto-reconnect.
> node-2 tried to re-connect to node-1 as:
> [org.apache.geode.distributed.internal.InternalDistributedSystem]-[ReconnectThread] [] Attempting to reconnect to the distributed system.  This is attempt #1.
> [org.apache.geode.distributed.internal.InternalDistributedSystem]-[ReconnectThread] [] Attempting to reconnect to the distributed system.  This is attempt #2.
> [org.apache.geode.distributed.internal.InternalDistributedSystem]-[ReconnectThread] [] Attempting to reconnect to the distributed system.  This is attempt #3.
> Finally reported below error after 3 attempts as:
> INFO  [org.apache.geode.logging.internal.LoggingProviderLoader]-[ReconnectThread] [] Using org.apache.geode.logging.internal.SimpleLoggingProvider for service org.apache.geode.logging.internal.spi.LoggingProvider
> INFO  [org.apache.geode.internal.InternalDataSerializer]-[ReconnectThread] [] initializing InternalDataSerializer with 0 services
> INFO  [org.apache.geode.distributed.internal.InternalDistributedSystem]-[ReconnectThread] [] performing a quorum check to see if location services can be started early
> INFO  [org.apache.geode.distributed.internal.InternalDistributedSystem]-[ReconnectThread] [] Quorum check passed - allowing location services to start early
> WARN  [org.apache.geode.distributed.internal.InternalDistributedSystem]-[ReconnectThread] [] Exception occurred while trying to connect the system during reconnect
> java.lang.IllegalStateException: A locator can not be created because one already exists in this JVM.
>         at org.apache.geode.distributed.internal.InternalLocator.createLocator(InternalLocator.java:298) ~[geode-core-1.14.0.jar:?]
>         at org.apache.geode.distributed.internal.InternalLocator.createLocator(InternalLocator.java:273) ~[geode-core-1.14.0.jar:?]
>         at org.apache.geode.distributed.internal.InternalDistributedSystem.startInitLocator(InternalDistributedSystem.java:916) ~[geode-core-1.14.0.jar:?]
>         at org.apache.geode.distributed.internal.InternalDistributedSystem.initialize(InternalDistributedSystem.java:768) ~[geode-core-1.14.0.jar:?]
>         at org.apache.geode.distributed.internal.InternalDistributedSystem.access$200(InternalDistributedSystem.java:135) ~[geode-core-1.14.0.jar:?]
>         at org.apache.geode.distributed.internal.InternalDistributedSystem$Builder.build(InternalDistributedSystem.java:3034) ~[geode-core-1.14.0.jar:?]
>         at org.apache.geode.distributed.internal.InternalDistributedSystem.connectInternal(InternalDistributedSystem.java:290) ~[geode-core-1.14.0.jar:?]
>         at org.apache.geode.distributed.internal.InternalDistributedSystem.reconnect(InternalDistributedSystem.java:2605) ~[geode-core-1.14.0.jar:?]
>         at org.apache.geode.distributed.internal.InternalDistributedSystem.tryReconnect(InternalDistributedSystem.java:2424) ~[geode-core-1.14.0.jar:?]
>         at org.apache.geode.distributed.internal.InternalDistributedSystem.disconnect(InternalDistributedSystem.java:1275) ~[geode-core-1.14.0.jar:?]
>         at org.apache.geode.distributed.internal.ClusterDistributionManager$DMListener.membershipFailure(ClusterDistributionManager.java:2326) ~[geode-core-1.14.0.jar:?]
>         at org.apache.geode.distributed.internal.membership.gms.GMSMembership.uncleanShutdown(GMSMembership.java:1187) ~[geode-membership-1.14.0.jar:?]
>         at org.apache.geode.distributed.internal.membership.gms.GMSMembership$ManagerImpl.lambda$forceDisconnect$0(GMSMembership.java:1811) ~[geode-membership-1.14.0.jar:?]
>         at java.lang.Thread.run(Thread.java:829) [?:?]
>  



--
This message was sent by Atlassian Jira
(v8.20.7#820007)