You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@kafka.apache.org by "Sönke Liebau (JIRA)" <ji...@apache.org> on 2017/11/20 05:52:00 UTC

[jira] [Commented] (KAFKA-6234) Transient failure in kafka.api.AdminClientIntegrationTest.testLogStartOffsetCheckpoint

    [ https://issues.apache.org/jira/browse/KAFKA-6234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16258826#comment-16258826 ] 

Sönke Liebau commented on KAFKA-6234:
-------------------------------------

Is the 1 second timeout in that relevant to what is being tested, or could we just try to increase this a little to fix the shaky test? Especially with stopping and restarting all brokers before that call, there may be some internal stuff going on that just needs longer than a second to complete. 
On the other hand, the waitUntilTrue call is obviously intended to solve this for a leader not available exception, but this is being interrupted by the Timeoutexception being thrown in KafkaFutureImpl.get, so this could get extended to catch this exception as well.
Since the delete request is inside the waitUntilTrue and we probably don't want to retry the entire delete just because we didn't get a low watermark returned I think increasing the timeout a bit as a first step is the way to go.

> Transient failure in kafka.api.AdminClientIntegrationTest.testLogStartOffsetCheckpoint
> --------------------------------------------------------------------------------------
>
>                 Key: KAFKA-6234
>                 URL: https://issues.apache.org/jira/browse/KAFKA-6234
>             Project: Kafka
>          Issue Type: Bug
>            Reporter: Guozhang Wang
>
> Saw this once: https://builds.apache.org/job/kafka-pr-jdk9-scala2.12/2669/testReport/junit/kafka.api/AdminClientIntegrationTest/testLogStartOffsetCheckpoint/
> {code}
> Stacktrace
> java.util.concurrent.TimeoutException
> 	at org.apache.kafka.common.internals.KafkaFutureImpl$SingleWaiter.await(KafkaFutureImpl.java:108)
> 	at org.apache.kafka.common.internals.KafkaFutureImpl.get(KafkaFutureImpl.java:225)
> 	at kafka.api.AdminClientIntegrationTest.$anonfun$testLogStartOffsetCheckpoint$3(AdminClientIntegrationTest.scala:762)
> 	at kafka.utils.TestUtils$.waitUntilTrue(TestUtils.scala:858)
> 	at kafka.api.AdminClientIntegrationTest.testLogStartOffsetCheckpoint(AdminClientIntegrationTest.scala:756)
> 	at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> 	at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> 	at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> 	at java.base/java.lang.reflect.Method.invoke(Method.java:564)
> 	at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
> 	at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
> 	at org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
> 	at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
> 	at org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
> 	at org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
> 	at org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:298)
> 	at org.junit.internal.runners.statements.FailOnTimeout$CallableStatement.call(FailOnTimeout.java:292)
> 	at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
> 	at java.base/java.lang.Thread.run(Thread.java:844)
> {code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)