You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kafka.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2017/04/28 12:40:04 UTC
[jira] [Commented] (KAFKA-3761) Controller has RunningAsBroker
instead of RunningAsController state
[ https://issues.apache.org/jira/browse/KAFKA-3761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15988745#comment-15988745 ]
ASF GitHub Bot commented on KAFKA-3761:
---------------------------------------
GitHub user ijuma opened a pull request:
https://github.com/apache/kafka/pull/2935
MINOR: onControllerResignation should be invoked if triggerControllerMove is called
This fixes a transient test failure due to a NPE in ControllerFailoverTest.testMetadataUpdate:
```text
Caused by: java.lang.NullPointerException
at kafka.controller.ControllerBrokerRequestBatch.addUpdateMetadataRequestForBrokers(ControllerChannelManager.scala:338)
at kafka.controller.KafkaController.sendUpdateMetadataRequest(KafkaController.scala:975)
at kafka.controller.ControllerFailoverTest.testMetadataUpdate(ControllerFailoverTest.scala:141)
```
The underlying issue is that setting `activeControllerId.set(-1)` in `triggerControllerMove`
causes `Reelect` not to invoke `onControllerResignation`. Among other things, this
causes an IllegalStateException to be thrown when `KafkaScheduler.startup` is invoked
for the second time without the corresponding `shutdown`.
I also updated the test so that we can trigger this issue deterministically instead of
transiently.
Finally, I included a few clean-ups:
1. No longer update the broker state in `onControllerFailover`. This is no longer needed
since we removed the `RunningAsController` state (KAFKA-3761).
2. Trivial clean-ups in KafkaController
3. Removed unused parameter in `ZkUtils.getPartitionLeaderAndIsrForTopics`
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/ijuma/kafka on-controller-resignation-if-trigger-controller-move
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/kafka/pull/2935.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #2935
----
commit c01d29b3a95e7ddffb91550397a5505d9711d5c8
Author: Ismael Juma <is...@juma.me.uk>
Date: 2017-04-28T12:30:03Z
MINOR: onControllerResignation should be invoked if triggerControllerMove is called
commit f28de697f7d109893537652e8b8216c4d06677a7
Author: Ismael Juma <is...@juma.me.uk>
Date: 2017-04-28T12:30:47Z
Remove remnant broker state update in `onControllerFailover`
commit 898b88b59cffbfdb7df864d0b070ed7a4960601e
Author: Ismael Juma <is...@juma.me.uk>
Date: 2017-04-28T12:31:15Z
A few trivial clean-ups in KafkaController
commit 241b9890ab47b4670e61f3f9d3b51c6aa92a8a94
Author: Ismael Juma <is...@juma.me.uk>
Date: 2017-04-28T12:31:38Z
Remove unused parameter in `getPartitionLeaderAndIsrForTopics`
----
> Controller has RunningAsBroker instead of RunningAsController state
> -------------------------------------------------------------------
>
> Key: KAFKA-3761
> URL: https://issues.apache.org/jira/browse/KAFKA-3761
> Project: Kafka
> Issue Type: Bug
> Reporter: Ismael Juma
> Assignee: Roger Hoover
> Fix For: 0.10.1.0
>
>
> In `KafkaServer.start`, we start `KafkaController`:
> {code}
> /* start kafka controller */
> kafkaController = new KafkaController(config, zkUtils, brokerState, kafkaMetricsTime, metrics, threadNamePrefix)
> kafkaController.startup()
> {code}
> Which sets the state to `RunningAsController` in `KafkaController.onControllerFailover`:
> `brokerState.newState(RunningAsController)`
> And this later gets set to `RunningAsBroker`.
> This doesn't match the diagram in `BrokerStates`. [~junrao] suggested that we should start the controller after we register the broker in ZK, but this seems tricky as we need to controller in `KafkaApis`.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)