You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Adam Holmberg (Jira)" <ji...@apache.org> on 2021/03/26 00:36:00 UTC

[jira] [Comment Edited] (CASSANDRA-13517) dtest failure in paxos_tests.TestPaxos.contention_test_many_threads

    [ https://issues.apache.org/jira/browse/CASSANDRA-13517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17309065#comment-17309065 ] 

Adam Holmberg edited comment on CASSANDRA-13517 at 3/26/21, 12:35 AM:
----------------------------------------------------------------------

The test fails this way intermittently, but consistently on a resource-constrained VM. Failures are characterized by driver heartbeat timeouts which [exits the worker|https://github.com/apache/cassandra-dtest/blob/49f46fce94c8f25f32e9b778ded8b14c30ad851e/paxos_test.py#L145-L149] and does not retry. I think the server and cluster are just being overwhelmed. This never fails on a well-provisioned machine.

The proposed change creates a client connection with ample timeouts and heartbeats disabled. I'm also reducing the concurrency from one arbitrary number to another slightly smaller arbitrary number to make it a bit more appropriate in the envelope of a single-host three-node test cluster.

[test patch|https://github.com/aholmberg/cassandra-dtest/pull/5]
 [ci|https://app.circleci.com/pipelines/github/aholmberg/cassandra?branch=CASSANDRA-13517] (started, not reviewed)


was (Author: aholmber):
The test fails this way intermittently, but consistently on a resource-constrained VM. Failures are characterized by driver heartbeat timeouts which [exits the worker|https://github.com/apache/cassandra-dtest/blob/49f46fce94c8f25f32e9b778ded8b14c30ad851e/paxos_test.py#L145-L149] and does not retry.

The proposed change creates a client connection with ample timeouts and heartbeats disabled. I'm also reducing the concurrency from one arbitrary number to another slightly smaller arbitrary number to make it a bit more appropriate in the envelope of a single-host three-node test cluster.

[test patch|https://github.com/aholmberg/cassandra-dtest/pull/5]
 [ci|https://app.circleci.com/pipelines/github/aholmberg/cassandra?branch=CASSANDRA-13517] (started, not reviewed)

> dtest failure in paxos_tests.TestPaxos.contention_test_many_threads
> -------------------------------------------------------------------
>
>                 Key: CASSANDRA-13517
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-13517
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Test/dtest/python
>            Reporter: Ariel Weisberg
>            Assignee: Adam Holmberg
>            Priority: Normal
>              Labels: dtest, test-failure, test-failure-fresh
>             Fix For: 4.0-rc
>
>         Attachments: test_failure.txt
>
>
> Error Message
> AssertionError: value=278, errors=22, retries=2888 assert (278 == (300 * 1))
> {noformat}
> > assert (value == threads * iterations) and (errors == 0), "value={}, errors={}, retries={}".format(value, errors, retries) 
> E AssertionError: value=278, errors=22, retries=2888 E assert (278 == (300 * 1)) 
> paxos_test.py:195: AssertionError
>   {noformat}
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@cassandra.apache.org
For additional commands, e-mail: commits-help@cassandra.apache.org