You are viewing a plain text version of this content. The canonical link for it is here.

Posted to users@kafka.apache.org by Krishnakumar Gurumurthy <Kr...@alten.se> on 2018/02/05 08:10:20 UTC

Issues with Kafka Group Coordinator Failover

Dear All,

In our project(as part of kafka  failover evaluation), we have a single cluster with five kafka nodes (five partition), three consumers (attached to single group) and single Zookeeper node. As soon as cluster startups, we see leader election per partition and each consumers discovers the group co-ordinator. Now, when manually shutdown kafka service in the co-ordinator node, entire cluster goes down (means no publish/subscribe of messages happening).

As per Kafka wiki  co-ordinator design<https://cwiki.apache.org/confluence/display/KAFKA/Kafka+Detailed+Consumer+Coordinator+Design#KafkaDetailedConsumerCoordinatorDesign-10.OnCoordinatorFailover> and client-assignment<https://cwiki.apache.org/confluence/display/KAFKA/Kafka+Client-side+Assignment+Proposal#KafkaClient-sideAssignmentProposal-CoordinatorStateMachine>, we see coordinator failover handling as part of kafka cluster. Kindly let us know anyone  in the community has encountered this earlier or any known solution available.

Thanks & Regards,
Krishnakumar G