You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@samza.apache.org by "Martin Kleppmann (JIRA)" <ji...@apache.org> on 2014/04/05 11:16:15 UTC

[jira] [Commented] (SAMZA-224) TestStatefulTask.testShouldStartAndRestore fails intermittently

    [ https://issues.apache.org/jira/browse/SAMZA-224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13961021#comment-13961021 ] 

Martin Kleppmann commented on SAMZA-224:
----------------------------------------

It seems that this is not purely random, because I sometimes have the test fail for several test runs in a row, and then succeed several times in a row (with the same code). Last night I saw it fail 4 times in a row, then succeed 10 times in a row. Perhaps there is some external environmental condition (e.g. other processes on the machine using the network?) which affects whether this occurs or not, but I've not yet found a clear correlation with any particular factor.

I've tried reducing the timeout from 60 seconds to 20 seconds, but that doesn't seem to trigger the issue to happen any more frequently. And in the phase when it was failing 4 times in a row, I set it to 300 seconds in one case, and it still failed. So that suggests it's not just a boundary condition where something is sometimes slower than the 60 second threshold. It must be something else.

> TestStatefulTask.testShouldStartAndRestore fails intermittently
> ---------------------------------------------------------------
>
>                 Key: SAMZA-224
>                 URL: https://issues.apache.org/jira/browse/SAMZA-224
>             Project: Samza
>          Issue Type: Bug
>            Reporter: Martin Kleppmann
>
> [~closeuris] reported the following issue on SAMZA-185. I'm now intermittently seeing this problem too, so I'm opening a new issue for it (as I'm not sure the issue has anything to do with the Kafka 0.8.1 upgrade).
> {noformat}
> org.apache.samza.test.integration.TestStatefulTask > testShouldStartAndRestore FAILED
>     java.lang.AssertionError at TestStatefulTask.scala:356
> 2 tests completed, 1 failed
> :samza-test_2.10:test FAILED
> {noformat}
> Logs provided by Yan (mine look similar):
> Failed tests: http://pastebin.com/3nEDw9jC
> Standard output: http://pastebin.com/20vZbk7c
> Standard error: http://pastebin.com/gU6cTNAr



--
This message was sent by Atlassian JIRA
(v6.2#6252)