You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Piotr Nowojski (JIRA)" <ji...@apache.org> on 2018/05/25 12:05:00 UTC
[jira] [Updated] (FLINK-9439) DispatcherTest#testJobRecovery dead
locks
[ https://issues.apache.org/jira/browse/FLINK-9439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Piotr Nowojski updated FLINK-9439:
----------------------------------
Description:
Rarely DispatcherTest#testJobRecovery dead locks on master. Example deadlock in my pull request:
[https://api.travis-ci.org/v3/job/383147736/log.txt]
afterwards I have stripped down `flink-runtime` and looped on travis this test and it also dead locks on master:
[https://github.com/pnowojski/flink/commits/loop-runtime-master]
(note, that looped versions sometimes also fails with an exception from setUp: {{ akka.actor.InvalidActorNameException: actor name [dispatcher_testJobRecovery] is not unique! }} but this might be unrelated).
Example failed build:
[https://travis-ci.org/pnowojski/flink/builds/383650106]
was:
Rarely DispatcherTest#testJobRecovery dead locks on master. Example deadlock in my pull request:
[https://api.travis-ci.org/v3/job/383147736/log.txt]
afterwards I have stripped down `flink-runtime` and looped on travis this test and it also dead locks on master:
[https://github.com/pnowojski/flink/commits/loop-runtime-master]
(note, that looped versions sometimes also fails with an exception from setUp: {{
akka.actor.InvalidActorNameException: actor name [dispatcher_testJobRecovery] is not unique! }} but this might be unrelated).
Example failed build:
[https://travis-ci.org/pnowojski/flink/builds/383650106]
> DispatcherTest#testJobRecovery dead locks
> -----------------------------------------
>
> Key: FLINK-9439
> URL: https://issues.apache.org/jira/browse/FLINK-9439
> Project: Flink
> Issue Type: Bug
> Components: Distributed Coordination
> Affects Versions: 1.5.0
> Reporter: Piotr Nowojski
> Priority: Blocker
>
> Rarely DispatcherTest#testJobRecovery dead locks on master. Example deadlock in my pull request:
> [https://api.travis-ci.org/v3/job/383147736/log.txt]
> afterwards I have stripped down `flink-runtime` and looped on travis this test and it also dead locks on master:
> [https://github.com/pnowojski/flink/commits/loop-runtime-master]
> (note, that looped versions sometimes also fails with an exception from setUp: {{ akka.actor.InvalidActorNameException: actor name [dispatcher_testJobRecovery] is not unique! }} but this might be unrelated).
> Example failed build:
> [https://travis-ci.org/pnowojski/flink/builds/383650106]
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)