You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@kafka.apache.org by "Jason Gustafson (Jira)" <ji...@apache.org> on 2020/05/28 20:27:00 UTC

[jira] [Updated] (KAFKA-10061) Flaky Test `ReassignPartitionsIntegrationTest .testCancellation`

     [ https://issues.apache.org/jira/browse/KAFKA-10061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jason Gustafson updated KAFKA-10061:
------------------------------------
    Description: 
We have seen this a few times:
{code}
org.scalatest.exceptions.TestFailedException: Timed out waiting for verifyAssignment result VerifyAssignmentResult(Map(foo-0 -> PartitionReassignmentState(List(0, 1, 3, 2),List(0, 1, 3),false), baz-1 -> PartitionReassignmentState(List(0, 2, 3, 1),List(0, 2, 3),false)),true,Map(),false).  The latest result was VerifyAssignmentResult(Map(foo-0 -> PartitionReassignmentState(ArrayBuffer(0, 1, 3),List(0, 1, 3),true), baz-1 -> PartitionReassignmentState(ArrayBuffer(0, 2, 3),List(0, 2, 3),true)),false,HashMap(),false)
{code}

It looks like the reassignment is completing earlier than the test expects. See the following from the log:

{code}
Successfully started partition reassignments for baz-1,foo-0
==> verifyAssignment(adminClient, jsonString={"version":1,"partitions":[{"topic":"foo","partition":0,"replicas":[0,1,3],"log_dirs":["any","any","any"]},{"topic":"baz","partition":1,"replicas":[0,2,3],"log_dirs":["any","any","any"]}]})
Status of partition reassignment:
Reassignment of partition baz-1 is still in progress.
Reassignment of partition foo-0 is complete.
{code}

A successful run looks like this:
{code}
Successfully started partition reassignments for baz-1,foo-0
==> verifyAssignment(adminClient, jsonString={"version":1,"partitions":[{"topic":"foo","partition":0,"replicas":[0,1,3],"log_dirs":["any","any","any"]},{"topic":"baz","partition":1,"replicas":[0,2,3],"log_dirs":["any","any","any"]}]})
Status of partition reassignment:
Reassignment of partition baz-1 is still in progress.
Reassignment of partition foo-0 is still in progress.
{code}

  was:
We have seen this a few times:
```
org.scalatest.exceptions.TestFailedException: Timed out waiting for verifyAssignment result VerifyAssignmentResult(Map(foo-0 -> PartitionReassignmentState(List(0, 1, 3, 2),List(0, 1, 3),false), baz-1 -> PartitionReassignmentState(List(0, 2, 3, 1),List(0, 2, 3),false)),true,Map(),false).  The latest result was VerifyAssignmentResult(Map(foo-0 -> PartitionReassignmentState(ArrayBuffer(0, 1, 3),List(0, 1, 3),true), baz-1 -> PartitionReassignmentState(ArrayBuffer(0, 2, 3),List(0, 2, 3),true)),false,HashMap(),false)
```
It looks like the reassignment is completing earlier than the test expects. See the following from the log:

```
Successfully started partition reassignments for baz-1,foo-0
==> verifyAssignment(adminClient, jsonString={"version":1,"partitions":[{"topic":"foo","partition":0,"replicas":[0,1,3],"log_dirs":["any","any","any"]},{"topic":"baz","partition":1,"replicas":[0,2,3],"log_dirs":["any","any","any"]}]})
Status of partition reassignment:
Reassignment of partition baz-1 is still in progress.
Reassignment of partition foo-0 is complete.
```

A successful run looks like this:
```
Successfully started partition reassignments for baz-1,foo-0
==> verifyAssignment(adminClient, jsonString={"version":1,"partitions":[{"topic":"foo","partition":0,"replicas":[0,1,3],"log_dirs":["any","any","any"]},{"topic":"baz","partition":1,"replicas":[0,2,3],"log_dirs":["any","any","any"]}]})
Status of partition reassignment:
Reassignment of partition baz-1 is still in progress.
Reassignment of partition foo-0 is still in progress.
```


> Flaky Test `ReassignPartitionsIntegrationTest .testCancellation`
> ----------------------------------------------------------------
>
>                 Key: KAFKA-10061
>                 URL: https://issues.apache.org/jira/browse/KAFKA-10061
>             Project: Kafka
>          Issue Type: Bug
>            Reporter: Jason Gustafson
>            Assignee: Jason Gustafson
>            Priority: Major
>
> We have seen this a few times:
> {code}
> org.scalatest.exceptions.TestFailedException: Timed out waiting for verifyAssignment result VerifyAssignmentResult(Map(foo-0 -> PartitionReassignmentState(List(0, 1, 3, 2),List(0, 1, 3),false), baz-1 -> PartitionReassignmentState(List(0, 2, 3, 1),List(0, 2, 3),false)),true,Map(),false).  The latest result was VerifyAssignmentResult(Map(foo-0 -> PartitionReassignmentState(ArrayBuffer(0, 1, 3),List(0, 1, 3),true), baz-1 -> PartitionReassignmentState(ArrayBuffer(0, 2, 3),List(0, 2, 3),true)),false,HashMap(),false)
> {code}
> It looks like the reassignment is completing earlier than the test expects. See the following from the log:
> {code}
> Successfully started partition reassignments for baz-1,foo-0
> ==> verifyAssignment(adminClient, jsonString={"version":1,"partitions":[{"topic":"foo","partition":0,"replicas":[0,1,3],"log_dirs":["any","any","any"]},{"topic":"baz","partition":1,"replicas":[0,2,3],"log_dirs":["any","any","any"]}]})
> Status of partition reassignment:
> Reassignment of partition baz-1 is still in progress.
> Reassignment of partition foo-0 is complete.
> {code}
> A successful run looks like this:
> {code}
> Successfully started partition reassignments for baz-1,foo-0
> ==> verifyAssignment(adminClient, jsonString={"version":1,"partitions":[{"topic":"foo","partition":0,"replicas":[0,1,3],"log_dirs":["any","any","any"]},{"topic":"baz","partition":1,"replicas":[0,2,3],"log_dirs":["any","any","any"]}]})
> Status of partition reassignment:
> Reassignment of partition baz-1 is still in progress.
> Reassignment of partition foo-0 is still in progress.
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)