You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@geode.apache.org by Alberto Gomez <al...@est.tech> on 2020/06/06 14:06:24 UTC

Problem in rolling upgrade since 1.12

Hi,

I have observed that since version 1.12 rolling upgrades to future versions leave the first upgraded locator "as if" it was still on version 1.12.

This is the output from "list members" before starting the upgrade from version 1.12:

Name | Id
---- | -----------------------------------------------------------
vm2  | 192.168.0.37(vm2:29367:locator)<ec><v0>:41001 [Coordinator]
vm0  | 192.168.0.37(vm0:29260)<v1>:41002
vm1  | 192.168.0.37(vm1:29296)<v2>:41003


And this is the output from "list members" after upgrading the first locator from 1.12 to 1.13/1.14:

Name | Id
---- | --------------------------------------------------------------------------------
vm2  | 192.168.0.37(vm2:1453:locator)<ec><v8>:41001(version:GEODE 1.12.0) [Coordinator]
vm0  | 192.168.0.37(vm0:810)<v1>:41002(version:GEODE 1.12.0)
vm1  | 192.168.0.37(vm1:849)<v2>:41003(version:GEODE 1.12.0)


Finally this is the output in gfsh once the rolling upgrade has been completed (locators and servers upgraded):

Name | Id
---- | --------------------------------------------------------------------------------
vm2  | 192.168.0.37(vm2:1453:locator)<ec><v8>:41001(version:GEODE 1.12.0) [Coordinator]
vm0  | 192.168.0.37(vm0:2457)<v23>:41002
vm1  | 192.168.0.37(vm1:2576)<v25>:41003

I verified this by running manual tests and also by running the following upgrade test (had to stop it in the middle to connect via gfsh and get the gfsh outputs):
RollingUpgradeRollServersOnPartitionedRegion_dataserializable.testRollServersOnPartitionedRegion_dataserializable

After the rolling upgrade, the shutdown command fails with the following error:
Member 192.168.0.37(vm2:1453:locator)<ec><v8>:41001 could not be found.  Please verify the member name or ID and try again.

The only way I have found to come out of the situation is by restarting the locator.
Once restarted again, the output of gfsh shows that all members are upgraded to the new version, i.e. the locator does not show anymore that it is on version GEODE 1.12.0.

Anybody has any clue why this is happening?

Thanks in advance,

/Alberto G.

Odg: Problem in rolling upgrade since 1.12

Posted by Mario Ivanac <ma...@est.tech>.
Hi,

as I see in 1.12 release, there were bigger impacts in membership part of code with:

"Move the membership code into a separate gradle sub-project"

https://issues.apache.org/jira/browse/GEODE-6883

Maybe to check is this connected to reported bug.

BR,
Mario
________________________________
Šalje: Alberto Gomez <al...@est.tech>
Poslano: 8. lipnja 2020. 9:29
Prima: dev@geode.apache.org <de...@geode.apache.org>
Predmet: Re: Problem in rolling upgrade since 1.12

Hi,

I attach a diff for the modified test case in case you would like to use it to check the problem I mentioned.

BR,

Alberto
________________________________
From: Alberto Gomez <al...@est.tech>
Sent: Saturday, June 6, 2020 4:06 PM
To: dev@geode.apache.org <de...@geode.apache.org>
Subject: Problem in rolling upgrade since 1.12

Hi,

I have observed that since version 1.12 rolling upgrades to future versions leave the first upgraded locator "as if" it was still on version 1.12.

This is the output from "list members" before starting the upgrade from version 1.12:

Name | Id
---- | -----------------------------------------------------------
vm2  | 192.168.0.37(vm2:29367:locator)<ec><v0>:41001 [Coordinator]
vm0  | 192.168.0.37(vm0:29260)<v1>:41002
vm1  | 192.168.0.37(vm1:29296)<v2>:41003


And this is the output from "list members" after upgrading the first locator from 1.12 to 1.13/1.14:

Name | Id
---- | --------------------------------------------------------------------------------
vm2  | 192.168.0.37(vm2:1453:locator)<ec><v8>:41001(version:GEODE 1.12.0) [Coordinator]
vm0  | 192.168.0.37(vm0:810)<v1>:41002(version:GEODE 1.12.0)
vm1  | 192.168.0.37(vm1:849)<v2>:41003(version:GEODE 1.12.0)


Finally this is the output in gfsh once the rolling upgrade has been completed (locators and servers upgraded):

Name | Id
---- | --------------------------------------------------------------------------------
vm2  | 192.168.0.37(vm2:1453:locator)<ec><v8>:41001(version:GEODE 1.12.0) [Coordinator]
vm0  | 192.168.0.37(vm0:2457)<v23>:41002
vm1  | 192.168.0.37(vm1:2576)<v25>:41003

I verified this by running manual tests and also by running the following upgrade test (had to stop it in the middle to connect via gfsh and get the gfsh outputs):
RollingUpgradeRollServersOnPartitionedRegion_dataserializable.testRollServersOnPartitionedRegion_dataserializable

After the rolling upgrade, the shutdown command fails with the following error:
Member 192.168.0.37(vm2:1453:locator)<ec><v8>:41001 could not be found.  Please verify the member name or ID and try again.

The only way I have found to come out of the situation is by restarting the locator.
Once restarted again, the output of gfsh shows that all members are upgraded to the new version, i.e. the locator does not show anymore that it is on version GEODE 1.12.0.

Anybody has any clue why this is happening?

Thanks in advance,

/Alberto G.

Re: Problem in rolling upgrade since 1.12

Posted by Bill Burcham <bi...@gmail.com>.
I made us a ticket for the SerializationException Alberto Gomez reported
https://issues.apache.org/jira/browse/GEODE-8251

On Thu, Jun 11, 2020 at 6:47 AM Alberto Gomez <al...@est.tech>
wrote:

> Thanks for the info, Bill.
>
> I have found another issue in rolling upgrade since 1.12.
>
> I have observed that when there are custom jars previously deployed, the
> locator is not able be started in the new version and the following
> exception is thrown:
>
> Exception in thread "main" org.apache.geode.SerializationException: Could
> not create an instance of
> org.apache.geode.management.internal.configuration.domain.Configuration .
>
> I have pushed another commit in the draft I sent before containing the new
> test case.
>
> /Alberto G.
> ________________________________
> From: Bill Burcham <bi...@gmail.com>
> Sent: Thursday, June 11, 2020 1:53 AM
> To: dev@geode.apache.org <de...@geode.apache.org>
> Subject: Re: Problem in rolling upgrade since 1.12
>
> Ernie made us a ticket for this issue:
> https://issues.apache.org/jira/browse/GEODE-8240
>
> On Mon, Jun 8, 2020 at 12:59 PM Alberto Gomez <al...@est.tech>
> wrote:
>
> > Hi Ernie,
> >
> > I have seen this problem in the support/1.13 branch and also on develop.
> >
> > Interestingly, the patch I sent is applied seamlessly in my local repo
> set
> > to the develop branch.
> >
> > The patch modifies the
> > RollingUpgradeRollServersOnPartitionedRegion_dataserializable test case
> by
> > running "list members" on an upgraded system is
> > RollingUpgradeRollServersOnPartitionedRegion_dataserializable. I run it
> > manually with the following command:
> >
> > ./gradlew geode-core:upgradeTest
> > --tests=RollingUpgradeRollServersOnPartitionedRegion_dataserializable
> >
> > I see it failing when upgrading from 1.12.
> >
> > I created a draft PR where you can see also the changes in the test case
> > that manifest the problem.
> >
> > See: https://github.com/apache/geode/pull/5224
> >
> >
> > Please, let me know if you need any more information.
> >
> > BR,
> >
> > Alberto
> > ________________________________
> > From: Ernie Burghardt <bu...@vmware.com>
> > Sent: Monday, June 8, 2020 9:04 PM
> > To: dev@geode.apache.org <de...@geode.apache.org>
> > Subject: Re: Problem in rolling upgrade since 1.12
> >
> > Hi Alberto,
> >
> > I’m looking at this, came across a couple blockers…
> > Do you have branch that exhibits this problem? Draft PR maybe?
> > I tried to apply you patch to latest develop, but the patch doesn’t pass
> > git apply’s check….
> > Also these tests pass on develop, would you be able to check against the
> > latest and update the diff?
> > I’m very interested in reproducing the issue you have observed.
> >
> > Thanks,
> > Ernie
> >
> > From: Alberto Gomez <al...@est.tech>
> > Reply-To: "dev@geode.apache.org" <de...@geode.apache.org>
> > Date: Monday, June 8, 2020 at 12:31 AM
> > To: "dev@geode.apache.org" <de...@geode.apache.org>
> > Subject: Re: Problem in rolling upgrade since 1.12
> >
> > Hi,
> >
> > I attach a diff for the modified test case in case you would like to use
> > it to check the problem I mentioned.
> >
> > BR,
> >
> > Alberto
> > ________________________________
> > From: Alberto Gomez <al...@est.tech>
> > Sent: Saturday, June 6, 2020 4:06 PM
> > To: dev@geode.apache.org <de...@geode.apache.org>
> > Subject: Problem in rolling upgrade since 1.12
> >
> > Hi,
> >
> > I have observed that since version 1.12 rolling upgrades to future
> > versions leave the first upgraded locator "as if" it was still on version
> > 1.12.
> >
> > This is the output from "list members" before starting the upgrade from
> > version 1.12:
> >
> > Name | Id
> > ---- | -----------------------------------------------------------
> > vm2  | 192.168.0.37(vm2:29367:locator)<ec><v0>:41001 [Coordinator]
> > vm0  | 192.168.0.37(vm0:29260)<v1>:41002
> > vm1  | 192.168.0.37(vm1:29296)<v2>:41003
> >
> >
> > And this is the output from "list members" after upgrading the first
> > locator from 1.12 to 1.13/1.14:
> >
> > Name | Id
> > ---- |
> >
> --------------------------------------------------------------------------------
> > vm2  | 192.168.0.37(vm2:1453:locator)<ec><v8>:41001(version:GEODE 1.12.0)
> > [Coordinator]
> > vm0  | 192.168.0.37(vm0:810)<v1>:41002(version:GEODE 1.12.0)
> > vm1  | 192.168.0.37(vm1:849)<v2>:41003(version:GEODE 1.12.0)
> >
> >
> > Finally this is the output in gfsh once the rolling upgrade has been
> > completed (locators and servers upgraded):
> >
> > Name | Id
> > ---- |
> >
> --------------------------------------------------------------------------------
> > vm2  | 192.168.0.37(vm2:1453:locator)<ec><v8>:41001(version:GEODE 1.12.0)
> > [Coordinator]
> > vm0  | 192.168.0.37(vm0:2457)<v23>:41002
> > vm1  | 192.168.0.37(vm1:2576)<v25>:41003
> >
> > I verified this by running manual tests and also by running the following
> > upgrade test (had to stop it in the middle to connect via gfsh and get
> the
> > gfsh outputs):
> >
> >
> RollingUpgradeRollServersOnPartitionedRegion_dataserializable.testRollServersOnPartitionedRegion_dataserializable
> >
> > After the rolling upgrade, the shutdown command fails with the following
> > error:
> > Member 192.168.0.37(vm2:1453:locator)<ec><v8>:41001 could not be found.
> > Please verify the member name or ID and try again.
> >
> > The only way I have found to come out of the situation is by restarting
> > the locator.
> > Once restarted again, the output of gfsh shows that all members are
> > upgraded to the new version, i.e. the locator does not show anymore that
> it
> > is on version GEODE 1.12.0.
> >
> > Anybody has any clue why this is happening?
> >
> > Thanks in advance,
> >
> > /Alberto G.
> >
>

Re: Problem in rolling upgrade since 1.12

Posted by Alberto Gomez <al...@est.tech>.
Thanks for the info, Bill.

I have found another issue in rolling upgrade since 1.12.

I have observed that when there are custom jars previously deployed, the locator is not able be started in the new version and the following exception is thrown:

Exception in thread "main" org.apache.geode.SerializationException: Could not create an instance of org.apache.geode.management.internal.configuration.domain.Configuration .

I have pushed another commit in the draft I sent before containing the new test case.

/Alberto G.
________________________________
From: Bill Burcham <bi...@gmail.com>
Sent: Thursday, June 11, 2020 1:53 AM
To: dev@geode.apache.org <de...@geode.apache.org>
Subject: Re: Problem in rolling upgrade since 1.12

Ernie made us a ticket for this issue:
https://issues.apache.org/jira/browse/GEODE-8240

On Mon, Jun 8, 2020 at 12:59 PM Alberto Gomez <al...@est.tech>
wrote:

> Hi Ernie,
>
> I have seen this problem in the support/1.13 branch and also on develop.
>
> Interestingly, the patch I sent is applied seamlessly in my local repo set
> to the develop branch.
>
> The patch modifies the
> RollingUpgradeRollServersOnPartitionedRegion_dataserializable test case by
> running "list members" on an upgraded system is
> RollingUpgradeRollServersOnPartitionedRegion_dataserializable. I run it
> manually with the following command:
>
> ./gradlew geode-core:upgradeTest
> --tests=RollingUpgradeRollServersOnPartitionedRegion_dataserializable
>
> I see it failing when upgrading from 1.12.
>
> I created a draft PR where you can see also the changes in the test case
> that manifest the problem.
>
> See: https://github.com/apache/geode/pull/5224
>
>
> Please, let me know if you need any more information.
>
> BR,
>
> Alberto
> ________________________________
> From: Ernie Burghardt <bu...@vmware.com>
> Sent: Monday, June 8, 2020 9:04 PM
> To: dev@geode.apache.org <de...@geode.apache.org>
> Subject: Re: Problem in rolling upgrade since 1.12
>
> Hi Alberto,
>
> I’m looking at this, came across a couple blockers…
> Do you have branch that exhibits this problem? Draft PR maybe?
> I tried to apply you patch to latest develop, but the patch doesn’t pass
> git apply’s check….
> Also these tests pass on develop, would you be able to check against the
> latest and update the diff?
> I’m very interested in reproducing the issue you have observed.
>
> Thanks,
> Ernie
>
> From: Alberto Gomez <al...@est.tech>
> Reply-To: "dev@geode.apache.org" <de...@geode.apache.org>
> Date: Monday, June 8, 2020 at 12:31 AM
> To: "dev@geode.apache.org" <de...@geode.apache.org>
> Subject: Re: Problem in rolling upgrade since 1.12
>
> Hi,
>
> I attach a diff for the modified test case in case you would like to use
> it to check the problem I mentioned.
>
> BR,
>
> Alberto
> ________________________________
> From: Alberto Gomez <al...@est.tech>
> Sent: Saturday, June 6, 2020 4:06 PM
> To: dev@geode.apache.org <de...@geode.apache.org>
> Subject: Problem in rolling upgrade since 1.12
>
> Hi,
>
> I have observed that since version 1.12 rolling upgrades to future
> versions leave the first upgraded locator "as if" it was still on version
> 1.12.
>
> This is the output from "list members" before starting the upgrade from
> version 1.12:
>
> Name | Id
> ---- | -----------------------------------------------------------
> vm2  | 192.168.0.37(vm2:29367:locator)<ec><v0>:41001 [Coordinator]
> vm0  | 192.168.0.37(vm0:29260)<v1>:41002
> vm1  | 192.168.0.37(vm1:29296)<v2>:41003
>
>
> And this is the output from "list members" after upgrading the first
> locator from 1.12 to 1.13/1.14:
>
> Name | Id
> ---- |
> --------------------------------------------------------------------------------
> vm2  | 192.168.0.37(vm2:1453:locator)<ec><v8>:41001(version:GEODE 1.12.0)
> [Coordinator]
> vm0  | 192.168.0.37(vm0:810)<v1>:41002(version:GEODE 1.12.0)
> vm1  | 192.168.0.37(vm1:849)<v2>:41003(version:GEODE 1.12.0)
>
>
> Finally this is the output in gfsh once the rolling upgrade has been
> completed (locators and servers upgraded):
>
> Name | Id
> ---- |
> --------------------------------------------------------------------------------
> vm2  | 192.168.0.37(vm2:1453:locator)<ec><v8>:41001(version:GEODE 1.12.0)
> [Coordinator]
> vm0  | 192.168.0.37(vm0:2457)<v23>:41002
> vm1  | 192.168.0.37(vm1:2576)<v25>:41003
>
> I verified this by running manual tests and also by running the following
> upgrade test (had to stop it in the middle to connect via gfsh and get the
> gfsh outputs):
>
> RollingUpgradeRollServersOnPartitionedRegion_dataserializable.testRollServersOnPartitionedRegion_dataserializable
>
> After the rolling upgrade, the shutdown command fails with the following
> error:
> Member 192.168.0.37(vm2:1453:locator)<ec><v8>:41001 could not be found.
> Please verify the member name or ID and try again.
>
> The only way I have found to come out of the situation is by restarting
> the locator.
> Once restarted again, the output of gfsh shows that all members are
> upgraded to the new version, i.e. the locator does not show anymore that it
> is on version GEODE 1.12.0.
>
> Anybody has any clue why this is happening?
>
> Thanks in advance,
>
> /Alberto G.
>

Re: Problem in rolling upgrade since 1.12

Posted by Bill Burcham <bi...@gmail.com>.
Ernie made us a ticket for this issue:
https://issues.apache.org/jira/browse/GEODE-8240

On Mon, Jun 8, 2020 at 12:59 PM Alberto Gomez <al...@est.tech>
wrote:

> Hi Ernie,
>
> I have seen this problem in the support/1.13 branch and also on develop.
>
> Interestingly, the patch I sent is applied seamlessly in my local repo set
> to the develop branch.
>
> The patch modifies the
> RollingUpgradeRollServersOnPartitionedRegion_dataserializable test case by
> running "list members" on an upgraded system is
> RollingUpgradeRollServersOnPartitionedRegion_dataserializable. I run it
> manually with the following command:
>
> ./gradlew geode-core:upgradeTest
> --tests=RollingUpgradeRollServersOnPartitionedRegion_dataserializable
>
> I see it failing when upgrading from 1.12.
>
> I created a draft PR where you can see also the changes in the test case
> that manifest the problem.
>
> See: https://github.com/apache/geode/pull/5224
>
>
> Please, let me know if you need any more information.
>
> BR,
>
> Alberto
> ________________________________
> From: Ernie Burghardt <bu...@vmware.com>
> Sent: Monday, June 8, 2020 9:04 PM
> To: dev@geode.apache.org <de...@geode.apache.org>
> Subject: Re: Problem in rolling upgrade since 1.12
>
> Hi Alberto,
>
> I’m looking at this, came across a couple blockers…
> Do you have branch that exhibits this problem? Draft PR maybe?
> I tried to apply you patch to latest develop, but the patch doesn’t pass
> git apply’s check….
> Also these tests pass on develop, would you be able to check against the
> latest and update the diff?
> I’m very interested in reproducing the issue you have observed.
>
> Thanks,
> Ernie
>
> From: Alberto Gomez <al...@est.tech>
> Reply-To: "dev@geode.apache.org" <de...@geode.apache.org>
> Date: Monday, June 8, 2020 at 12:31 AM
> To: "dev@geode.apache.org" <de...@geode.apache.org>
> Subject: Re: Problem in rolling upgrade since 1.12
>
> Hi,
>
> I attach a diff for the modified test case in case you would like to use
> it to check the problem I mentioned.
>
> BR,
>
> Alberto
> ________________________________
> From: Alberto Gomez <al...@est.tech>
> Sent: Saturday, June 6, 2020 4:06 PM
> To: dev@geode.apache.org <de...@geode.apache.org>
> Subject: Problem in rolling upgrade since 1.12
>
> Hi,
>
> I have observed that since version 1.12 rolling upgrades to future
> versions leave the first upgraded locator "as if" it was still on version
> 1.12.
>
> This is the output from "list members" before starting the upgrade from
> version 1.12:
>
> Name | Id
> ---- | -----------------------------------------------------------
> vm2  | 192.168.0.37(vm2:29367:locator)<ec><v0>:41001 [Coordinator]
> vm0  | 192.168.0.37(vm0:29260)<v1>:41002
> vm1  | 192.168.0.37(vm1:29296)<v2>:41003
>
>
> And this is the output from "list members" after upgrading the first
> locator from 1.12 to 1.13/1.14:
>
> Name | Id
> ---- |
> --------------------------------------------------------------------------------
> vm2  | 192.168.0.37(vm2:1453:locator)<ec><v8>:41001(version:GEODE 1.12.0)
> [Coordinator]
> vm0  | 192.168.0.37(vm0:810)<v1>:41002(version:GEODE 1.12.0)
> vm1  | 192.168.0.37(vm1:849)<v2>:41003(version:GEODE 1.12.0)
>
>
> Finally this is the output in gfsh once the rolling upgrade has been
> completed (locators and servers upgraded):
>
> Name | Id
> ---- |
> --------------------------------------------------------------------------------
> vm2  | 192.168.0.37(vm2:1453:locator)<ec><v8>:41001(version:GEODE 1.12.0)
> [Coordinator]
> vm0  | 192.168.0.37(vm0:2457)<v23>:41002
> vm1  | 192.168.0.37(vm1:2576)<v25>:41003
>
> I verified this by running manual tests and also by running the following
> upgrade test (had to stop it in the middle to connect via gfsh and get the
> gfsh outputs):
>
> RollingUpgradeRollServersOnPartitionedRegion_dataserializable.testRollServersOnPartitionedRegion_dataserializable
>
> After the rolling upgrade, the shutdown command fails with the following
> error:
> Member 192.168.0.37(vm2:1453:locator)<ec><v8>:41001 could not be found.
> Please verify the member name or ID and try again.
>
> The only way I have found to come out of the situation is by restarting
> the locator.
> Once restarted again, the output of gfsh shows that all members are
> upgraded to the new version, i.e. the locator does not show anymore that it
> is on version GEODE 1.12.0.
>
> Anybody has any clue why this is happening?
>
> Thanks in advance,
>
> /Alberto G.
>

Re: Problem in rolling upgrade since 1.12

Posted by Alberto Gomez <al...@est.tech>.
Hi Ernie,

I have seen this problem in the support/1.13 branch and also on develop.

Interestingly, the patch I sent is applied seamlessly in my local repo set to the develop branch.

The patch modifies the RollingUpgradeRollServersOnPartitionedRegion_dataserializable test case by running "list members" on an upgraded system is RollingUpgradeRollServersOnPartitionedRegion_dataserializable. I run it manually with the following command:

./gradlew geode-core:upgradeTest --tests=RollingUpgradeRollServersOnPartitionedRegion_dataserializable

I see it failing when upgrading from 1.12.

I created a draft PR where you can see also the changes in the test case that manifest the problem.

See: https://github.com/apache/geode/pull/5224


Please, let me know if you need any more information.

BR,

Alberto
________________________________
From: Ernie Burghardt <bu...@vmware.com>
Sent: Monday, June 8, 2020 9:04 PM
To: dev@geode.apache.org <de...@geode.apache.org>
Subject: Re: Problem in rolling upgrade since 1.12

Hi Alberto,

I’m looking at this, came across a couple blockers…
Do you have branch that exhibits this problem? Draft PR maybe?
I tried to apply you patch to latest develop, but the patch doesn’t pass git apply’s check….
Also these tests pass on develop, would you be able to check against the latest and update the diff?
I’m very interested in reproducing the issue you have observed.

Thanks,
Ernie

From: Alberto Gomez <al...@est.tech>
Reply-To: "dev@geode.apache.org" <de...@geode.apache.org>
Date: Monday, June 8, 2020 at 12:31 AM
To: "dev@geode.apache.org" <de...@geode.apache.org>
Subject: Re: Problem in rolling upgrade since 1.12

Hi,

I attach a diff for the modified test case in case you would like to use it to check the problem I mentioned.

BR,

Alberto
________________________________
From: Alberto Gomez <al...@est.tech>
Sent: Saturday, June 6, 2020 4:06 PM
To: dev@geode.apache.org <de...@geode.apache.org>
Subject: Problem in rolling upgrade since 1.12

Hi,

I have observed that since version 1.12 rolling upgrades to future versions leave the first upgraded locator "as if" it was still on version 1.12.

This is the output from "list members" before starting the upgrade from version 1.12:

Name | Id
---- | -----------------------------------------------------------
vm2  | 192.168.0.37(vm2:29367:locator)<ec><v0>:41001 [Coordinator]
vm0  | 192.168.0.37(vm0:29260)<v1>:41002
vm1  | 192.168.0.37(vm1:29296)<v2>:41003


And this is the output from "list members" after upgrading the first locator from 1.12 to 1.13/1.14:

Name | Id
---- | --------------------------------------------------------------------------------
vm2  | 192.168.0.37(vm2:1453:locator)<ec><v8>:41001(version:GEODE 1.12.0) [Coordinator]
vm0  | 192.168.0.37(vm0:810)<v1>:41002(version:GEODE 1.12.0)
vm1  | 192.168.0.37(vm1:849)<v2>:41003(version:GEODE 1.12.0)


Finally this is the output in gfsh once the rolling upgrade has been completed (locators and servers upgraded):

Name | Id
---- | --------------------------------------------------------------------------------
vm2  | 192.168.0.37(vm2:1453:locator)<ec><v8>:41001(version:GEODE 1.12.0) [Coordinator]
vm0  | 192.168.0.37(vm0:2457)<v23>:41002
vm1  | 192.168.0.37(vm1:2576)<v25>:41003

I verified this by running manual tests and also by running the following upgrade test (had to stop it in the middle to connect via gfsh and get the gfsh outputs):
RollingUpgradeRollServersOnPartitionedRegion_dataserializable.testRollServersOnPartitionedRegion_dataserializable

After the rolling upgrade, the shutdown command fails with the following error:
Member 192.168.0.37(vm2:1453:locator)<ec><v8>:41001 could not be found.  Please verify the member name or ID and try again.

The only way I have found to come out of the situation is by restarting the locator.
Once restarted again, the output of gfsh shows that all members are upgraded to the new version, i.e. the locator does not show anymore that it is on version GEODE 1.12.0.

Anybody has any clue why this is happening?

Thanks in advance,

/Alberto G.

Re: Problem in rolling upgrade since 1.12

Posted by Ernie Burghardt <bu...@vmware.com>.
Hi Alberto,

I’m looking at this, came across a couple blockers…
Do you have branch that exhibits this problem? Draft PR maybe?
I tried to apply you patch to latest develop, but the patch doesn’t pass git apply’s check….
Also these tests pass on develop, would you be able to check against the latest and update the diff?
I’m very interested in reproducing the issue you have observed.

Thanks,
Ernie

From: Alberto Gomez <al...@est.tech>
Reply-To: "dev@geode.apache.org" <de...@geode.apache.org>
Date: Monday, June 8, 2020 at 12:31 AM
To: "dev@geode.apache.org" <de...@geode.apache.org>
Subject: Re: Problem in rolling upgrade since 1.12

Hi,

I attach a diff for the modified test case in case you would like to use it to check the problem I mentioned.

BR,

Alberto
________________________________
From: Alberto Gomez <al...@est.tech>
Sent: Saturday, June 6, 2020 4:06 PM
To: dev@geode.apache.org <de...@geode.apache.org>
Subject: Problem in rolling upgrade since 1.12

Hi,

I have observed that since version 1.12 rolling upgrades to future versions leave the first upgraded locator "as if" it was still on version 1.12.

This is the output from "list members" before starting the upgrade from version 1.12:

Name | Id
---- | -----------------------------------------------------------
vm2  | 192.168.0.37(vm2:29367:locator)<ec><v0>:41001 [Coordinator]
vm0  | 192.168.0.37(vm0:29260)<v1>:41002
vm1  | 192.168.0.37(vm1:29296)<v2>:41003


And this is the output from "list members" after upgrading the first locator from 1.12 to 1.13/1.14:

Name | Id
---- | --------------------------------------------------------------------------------
vm2  | 192.168.0.37(vm2:1453:locator)<ec><v8>:41001(version:GEODE 1.12.0) [Coordinator]
vm0  | 192.168.0.37(vm0:810)<v1>:41002(version:GEODE 1.12.0)
vm1  | 192.168.0.37(vm1:849)<v2>:41003(version:GEODE 1.12.0)


Finally this is the output in gfsh once the rolling upgrade has been completed (locators and servers upgraded):

Name | Id
---- | --------------------------------------------------------------------------------
vm2  | 192.168.0.37(vm2:1453:locator)<ec><v8>:41001(version:GEODE 1.12.0) [Coordinator]
vm0  | 192.168.0.37(vm0:2457)<v23>:41002
vm1  | 192.168.0.37(vm1:2576)<v25>:41003

I verified this by running manual tests and also by running the following upgrade test (had to stop it in the middle to connect via gfsh and get the gfsh outputs):
RollingUpgradeRollServersOnPartitionedRegion_dataserializable.testRollServersOnPartitionedRegion_dataserializable

After the rolling upgrade, the shutdown command fails with the following error:
Member 192.168.0.37(vm2:1453:locator)<ec><v8>:41001 could not be found.  Please verify the member name or ID and try again.

The only way I have found to come out of the situation is by restarting the locator.
Once restarted again, the output of gfsh shows that all members are upgraded to the new version, i.e. the locator does not show anymore that it is on version GEODE 1.12.0.

Anybody has any clue why this is happening?

Thanks in advance,

/Alberto G.

Re: Problem in rolling upgrade since 1.12

Posted by Alberto Gomez <al...@est.tech>.
Hi,

I attach a diff for the modified test case in case you would like to use it to check the problem I mentioned.

BR,

Alberto
________________________________
From: Alberto Gomez <al...@est.tech>
Sent: Saturday, June 6, 2020 4:06 PM
To: dev@geode.apache.org <de...@geode.apache.org>
Subject: Problem in rolling upgrade since 1.12

Hi,

I have observed that since version 1.12 rolling upgrades to future versions leave the first upgraded locator "as if" it was still on version 1.12.

This is the output from "list members" before starting the upgrade from version 1.12:

Name | Id
---- | -----------------------------------------------------------
vm2  | 192.168.0.37(vm2:29367:locator)<ec><v0>:41001 [Coordinator]
vm0  | 192.168.0.37(vm0:29260)<v1>:41002
vm1  | 192.168.0.37(vm1:29296)<v2>:41003


And this is the output from "list members" after upgrading the first locator from 1.12 to 1.13/1.14:

Name | Id
---- | --------------------------------------------------------------------------------
vm2  | 192.168.0.37(vm2:1453:locator)<ec><v8>:41001(version:GEODE 1.12.0) [Coordinator]
vm0  | 192.168.0.37(vm0:810)<v1>:41002(version:GEODE 1.12.0)
vm1  | 192.168.0.37(vm1:849)<v2>:41003(version:GEODE 1.12.0)


Finally this is the output in gfsh once the rolling upgrade has been completed (locators and servers upgraded):

Name | Id
---- | --------------------------------------------------------------------------------
vm2  | 192.168.0.37(vm2:1453:locator)<ec><v8>:41001(version:GEODE 1.12.0) [Coordinator]
vm0  | 192.168.0.37(vm0:2457)<v23>:41002
vm1  | 192.168.0.37(vm1:2576)<v25>:41003

I verified this by running manual tests and also by running the following upgrade test (had to stop it in the middle to connect via gfsh and get the gfsh outputs):
RollingUpgradeRollServersOnPartitionedRegion_dataserializable.testRollServersOnPartitionedRegion_dataserializable

After the rolling upgrade, the shutdown command fails with the following error:
Member 192.168.0.37(vm2:1453:locator)<ec><v8>:41001 could not be found.  Please verify the member name or ID and try again.

The only way I have found to come out of the situation is by restarting the locator.
Once restarted again, the output of gfsh shows that all members are upgraded to the new version, i.e. the locator does not show anymore that it is on version GEODE 1.12.0.

Anybody has any clue why this is happening?

Thanks in advance,

/Alberto G.