You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kafka.apache.org by "Jun Rao (Jira)" <ji...@apache.org> on 2020/07/08 20:22:00 UTC

[jira] [Resolved] (KAFKA-10225) Increase default zk session timeout for system tests

     [ https://issues.apache.org/jira/browse/KAFKA-10225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jun Rao resolved KAFKA-10225.
-----------------------------
    Fix Version/s: 2.7.0
       Resolution: Fixed

merged the PR to trunk

> Increase default zk session timeout for system tests
> ----------------------------------------------------
>
>                 Key: KAFKA-10225
>                 URL: https://issues.apache.org/jira/browse/KAFKA-10225
>             Project: Kafka
>          Issue Type: Improvement
>            Reporter: Chia-Ping Tsai
>            Assignee: Chia-Ping Tsai
>            Priority: Minor
>             Fix For: 2.7.0
>
>
> I'm digging in the flaky system tests and then I noticed there are many flaky caused by following check.
> {code}
>         with node.account.monitor_log(KafkaService.STDOUT_STDERR_CAPTURE) as monitor:
>             node.account.ssh(cmd)
>             # Kafka 1.0.0 and higher don't have a space between "Kafka" and "Server"
>             monitor.wait_until("Kafka\s*Server.*started", timeout_sec=timeout_sec, backoff_sec=.25,
>                                err_msg="Kafka server didn't finish startup in %d seconds" % timeout_sec)
> {code}
> And the error message in broker log is shown below.
> {quote}
> kafka.zookeeper.ZooKeeperClientTimeoutException: Timed out waiting for connection while in state: CONNECTING
> 	at kafka.zookeeper.ZooKeeperClient.waitUntilConnected(ZooKeeperClient.scala:262)
> 	at kafka.zookeeper.ZooKeeperClient.<init>(ZooKeeperClient.scala:119)
> 	at kafka.zk.KafkaZkClient$.apply(KafkaZkClient.scala:1880)
> 	at kafka.server.KafkaServer.createZkClient$1(KafkaServer.scala:430)
> 	at kafka.server.KafkaServer.initZkClient(KafkaServer.scala:455)
> 	at kafka.server.KafkaServer.startup(KafkaServer.scala:227)
> 	at kafka.server.KafkaServerStartable.startup(KafkaServerStartable.scala:44)
> 	at kafka.Kafka$.main(Kafka.scala:82)
> 	at kafka.Kafka.main(Kafka.scala)
> {quote}
> I'm surprised the default timeout of zk connection in system test is only 2 seconds as the default timeout in production is increased to 18s (see https://github.com/apache/kafka/commit/4bde9bb3ccaf5571be76cb96ea051dadaeeaf5c7)
> {code}
> config_property.ZOOKEEPER_CONNECTION_TIMEOUT_MS: 2000
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)