You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Michael Shuler (JIRA)" <ji...@apache.org> on 2014/07/24 20:36:38 UTC

[jira] [Commented] (CASSANDRA-7568) Replacing a dead node using replace_address fails

    [ https://issues.apache.org/jira/browse/CASSANDRA-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14073495#comment-14073495 ] 

Michael Shuler commented on CASSANDRA-7568:
-------------------------------------------

It looks like this particular error was corrected, but in reproducing this, I found a hardcoded assertion for 256 tokens, if vnodes are enabled, so this should be corrected to use $NUM_TOKENS:

{noformat}
$ export MAX_HEAP_SIZE="1G"; export HEAP_NEWSIZE="256M"; NUM_TOKENS=32 PRINT_DEBUG=true KEEP_TEST_DIR=true nosetests --nocapture --nologcapture --verbosity=3 replace_address_test.py 
nose.config: INFO: Ignoring files matching ['^\\.', '^_', '^setup\\.py$']
replace_active_node_test (replace_address_test.TestReplaceAddress) ... cluster ccm directory: /tmp/dtest-eAEKKO
Starting cluster with 3 nodes.
Inserting Data...
Created keyspaces. Sleeping 1s for propagation.
total,interval_op_rate,interval_key_rate,latency,95th,99.9th,elapsed_time
10000,1000,1000,17.3,128.5,518.0,8


Total operation time      : 00:00:08
END
Starting node 4 to replace active node 3
ok
replace_nonexistent_node_test (replace_address_test.TestReplaceAddress) ... cluster ccm directory: /tmp/dtest-2siZVd
Starting cluster with 3 nodes.
Inserting Data...
Created keyspaces. Sleeping 1s for propagation.
total,interval_op_rate,interval_key_rate,latency,95th,99.9th,elapsed_time
10000,1000,1000,20.8,145.0,360.4,9


Total operation time      : 00:00:09
END
Start node 4 and replace an address with no node
ok
replace_stopped_node_test (replace_address_test.TestReplaceAddress) ... cluster ccm directory: /tmp/dtest-M5WycR
Starting cluster with 3 nodes.
Inserting Data...
Created keyspaces. Sleeping 1s for propagation.
total,interval_op_rate,interval_key_rate,latency,95th,99.9th,elapsed_time
10000,1000,1000,18.5,120.5,319.7,8


Total operation time      : 00:00:08
END
Stopping node 3.
Testing node stoppage (query should fail).
Starting node 4 to replace node 3
Verifying querying works again.
Verifying tokens migrated sucessfully
(' WARN [main] 2014-07-24 13:30:00,043 TokenMetadata.java (line 201) Token -3570582696082918770 changing ownership from /127.0.0.3 to /127.0.0.4\n', <_sre.SRE_Match object at 0x7f6e4c23a8b8>)
FAIL

======================================================================
FAIL: replace_stopped_node_test (replace_address_test.TestReplaceAddress)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/home/mshuler/git/cassandra-dtest/replace_address_test.py", line 78, in replace_stopped_node_test
    self.assertEqual(len(movedTokensList), 256)
AssertionError: 32 != 256

----------------------------------------------------------------------
Ran 3 tests in 252.630s

FAILED (failures=1)
{noformat}

> Replacing a dead node using replace_address fails
> -------------------------------------------------
>
>                 Key: CASSANDRA-7568
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-7568
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Tests
>            Reporter: Ala' Alkhaldi
>            Priority: Minor
>
> Failed assertion
> {code}
> ERROR [main] 2014-07-17 10:24:21,171 CassandraDaemon.java:474 - Exception encountered during startup
> java.lang.AssertionError: Expected 1 endpoint but found 0
>         at org.apache.cassandra.dht.RangeStreamer.getAllRangesWithStrictSourcesFor(RangeStreamer.java:222) ~[main/:na]
>         at org.apache.cassandra.dht.RangeStreamer.addRanges(RangeStreamer.java:131) ~[main/:na]
>         at org.apache.cassandra.dht.BootStrapper.bootstrap(BootStrapper.java:72) ~[main/:na]
>         at org.apache.cassandra.service.StorageService.bootstrap(StorageService.java:1049) ~[main/:na]
>         at org.apache.cassandra.service.StorageService.joinTokenRing(StorageService.java:811) ~[main/:na]
>         at org.apache.cassandra.service.StorageService.initServer(StorageService.java:626) ~[main/:na]
>         at org.apache.cassandra.service.StorageService.initServer(StorageService.java:511) ~[main/:na]
>         at org.apache.cassandra.service.CassandraDaemon.setup(CassandraDaemon.java:338) [main/:na]
>         at org.apache.cassandra.service.CassandraDaemon.activate(CassandraDaemon.java:457) [main/:na]
>         at org.apache.cassandra.service.CassandraDaemon.main(CassandraDaemon.java:546) [main/:na]
> {code}
> To replicate the bug run the "replace_address_test.replace_stopped_node_test" dtest



--
This message was sent by Atlassian JIRA
(v6.2#6252)