You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kafka.apache.org by "Lucas Wang (Jira)" <ji...@apache.org> on 2022/11/29 23:17:00 UTC

[jira] [Created] (KAFKA-14424) Cancellation of an ongoing replica reassignment should have sanity checks

Lucas Wang created KAFKA-14424:
----------------------------------

             Summary: Cancellation of an ongoing replica reassignment should have sanity checks
                 Key: KAFKA-14424
                 URL: https://issues.apache.org/jira/browse/KAFKA-14424
             Project: Kafka
          Issue Type: Improvement
            Reporter: Lucas Wang


When reassigning replicas, Kafka runs a sanity check to ensure all of the target replicas are alive before allowing the reassignment request to proceed.
However, for an AlterPartitionReassignments request that cancels an ongoing reassignment, there is no such check.
The result is that if the original replicas are offline, the cancellation may result in partitions
without any leaders. This problem has been observed in our clusters.

 

There should be some sanity check to ensure the cancellation would also land the partitions in valid states, e.g. by ensuring all of the original replicas are all alive.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)