You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Peter Schuller (Updated) (JIRA)" <ji...@apache.org> on 2012/03/09 22:08:57 UTC
[jira] [Updated] (CASSANDRA-4035) post-effective ownership nodetool
ring returns invalid information in some circumstances
[ https://issues.apache.org/jira/browse/CASSANDRA-4035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Peter Schuller updated CASSANDRA-4035:
--------------------------------------
Description:
CASSANDRA-3412 broke something. We had a test cluster that I observed was unbalanced (unexpected because it wasn't supposed to be). Later, it wasn't. We realized a node was being replaced at the time it showed as unbalanced. The diff shows:
{code}
-10.34.115.115 rack ael Up Normal 26.32 KB 9.09% 36090554067372261276418518970036022421
-10.35.108.128 rack aoa Up Normal 24.42 KB 9.09% 41246347505568298601621164537184025624
-10.34.244.104 rack ajk Up Normal 27.11 KB 9.09% 46402140943764335926823810104332028827
-10.35.86.129 rack ane Up Normal 31.67 KB 9.09% 51557934381960373252026455671480032030
+10.35.108.128 rack aoa Up Normal 24.42 KB 12.12% 41246347505568298601621164537184025624
+10.34.244.104 rack ajk Up Normal 27.11 KB 12.12% 46402140943764335926823810104332028827
+10.35.86.129 rack ane Up Normal 31.67 KB 12.12% 51557934381960373252026455671480032030
{code}
The node that caused this was being replaced (with replace token, not regular bootstrap) into the ring either during this or in relation to this in time. The node was never removed, and if a mistake was made to do regular bootstrap it should be showing up as joining.
Hypothesis without looking at code: Somehow nodes in HIBERNATE state are incorrectly considered?
(Marked fix for 1.1.1 because that's the fix-for of effective-ownership.)
was:
CASSANDRA-3412 broke something. We had a test cluster that I observed was unbalanced (unexpected because it wasn't supposed to be). Later, it wasn't. We realized a node was being replaced at the time it showed as unbalanced. The diff shows:
{code}
-10.34.115.115 smf1 ael Up Normal 26.32 KB 9.09% 36090554067372261276418518970036022421
-10.35.108.128 smf1 aoa Up Normal 24.42 KB 9.09% 41246347505568298601621164537184025624
-10.34.244.104 smf1 ajk Up Normal 27.11 KB 9.09% 46402140943764335926823810104332028827
-10.35.86.129 smf1 ane Up Normal 31.67 KB 9.09% 51557934381960373252026455671480032030
+10.35.108.128 smf1 aoa Up Normal 24.42 KB 12.12% 41246347505568298601621164537184025624
+10.34.244.104 smf1 ajk Up Normal 27.11 KB 12.12% 46402140943764335926823810104332028827
+10.35.86.129 smf1 ane Up Normal 31.67 KB 12.12% 51557934381960373252026455671480032030
{code}
The node that caused this was being replaced (with replace token, not regular bootstrap) into the ring either during this or in relation to this in time. The node was never removed, and if a mistake was made to do regular bootstrap it should be showing up as joining.
Hypothesis without looking at code: Somehow nodes in HIBERNATE state are incorrectly considered?
(Marked fix for 1.1.1 because that's the fix-for of effective-ownership.)
> post-effective ownership nodetool ring returns invalid information in some circumstances
> ----------------------------------------------------------------------------------------
>
> Key: CASSANDRA-4035
> URL: https://issues.apache.org/jira/browse/CASSANDRA-4035
> Project: Cassandra
> Issue Type: Bug
> Components: Core
> Reporter: Peter Schuller
> Fix For: 1.1.1
>
>
> CASSANDRA-3412 broke something. We had a test cluster that I observed was unbalanced (unexpected because it wasn't supposed to be). Later, it wasn't. We realized a node was being replaced at the time it showed as unbalanced. The diff shows:
> {code}
> -10.34.115.115 rack ael Up Normal 26.32 KB 9.09% 36090554067372261276418518970036022421
> -10.35.108.128 rack aoa Up Normal 24.42 KB 9.09% 41246347505568298601621164537184025624
> -10.34.244.104 rack ajk Up Normal 27.11 KB 9.09% 46402140943764335926823810104332028827
> -10.35.86.129 rack ane Up Normal 31.67 KB 9.09% 51557934381960373252026455671480032030
> +10.35.108.128 rack aoa Up Normal 24.42 KB 12.12% 41246347505568298601621164537184025624
> +10.34.244.104 rack ajk Up Normal 27.11 KB 12.12% 46402140943764335926823810104332028827
> +10.35.86.129 rack ane Up Normal 31.67 KB 12.12% 51557934381960373252026455671480032030
> {code}
> The node that caused this was being replaced (with replace token, not regular bootstrap) into the ring either during this or in relation to this in time. The node was never removed, and if a mistake was made to do regular bootstrap it should be showing up as joining.
> Hypothesis without looking at code: Somehow nodes in HIBERNATE state are incorrectly considered?
> (Marked fix for 1.1.1 because that's the fix-for of effective-ownership.)
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira