You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@kafka.apache.org by "Matthias Rampke (JIRA)" <ji...@apache.org> on 2017/12/08 15:22:00 UTC

[jira] [Commented] (KAFKA-5200) If a replicated topic is deleted with one broker down, it can't be recreated

    [ https://issues.apache.org/jira/browse/KAFKA-5200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16283713#comment-16283713 ] 

Matthias Rampke commented on KAFKA-5200:
----------------------------------------

To expand on the workaround [~huxi_2b] proposed:

If you cannot resurrect the dead broker itself, you can make Kafka act as if you did

#  Start a new broker, but then shut it down quickly (before any newly created partitions are assigned to it).
# in meta.properties, change the broker ID to the one of the dead broker
# Start it
# watch its logs – it will pick up the pending deletions and go through, or you can reassign at this point
# stop it again

This may be problematic if you have a lot of partition creation going on, because you need to avoid getting any partitions assigned to this broker while it's running, but otherwise this works without downtime.

> If a replicated topic is deleted with one broker down, it can't be recreated
> ----------------------------------------------------------------------------
>
>                 Key: KAFKA-5200
>                 URL: https://issues.apache.org/jira/browse/KAFKA-5200
>             Project: Kafka
>          Issue Type: Improvement
>          Components: core
>            Reporter: Edoardo Comar
>
> In a cluster with 5 broker, replication factor=3, min in sync=2,
> one broker went down 
> A user's app remained of course unaware of that and deleted a topic that (unknowingly) had a replica on the dead broker.
> The topic went in 'pending delete' mode
> The user then tried to recreate the topic - which failed, so his app was left stuck - no working topic and no ability to create one.
> The reassignment tool fails to move the replica out of the dead broker - specifically because the broker with the partition replica to move is dead :-)
> Incidentally the confluent-rebalancer docs say
> http://docs.confluent.io/current/kafka/post-deployment.html#scaling-the-cluster
> > Supports moving partitions away from dead brokers
> It'd be nice to similarly improve the opensource reassignment tool



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)