You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Till Rohrmann (JIRA)" <ji...@apache.org> on 2019/01/21 15:54:00 UTC
[jira] [Created] (FLINK-11400) JobManagerRunner does not wait for
suspension of JobMaster
Till Rohrmann created FLINK-11400:
-------------------------------------
Summary: JobManagerRunner does not wait for suspension of JobMaster
Key: FLINK-11400
URL: https://issues.apache.org/jira/browse/FLINK-11400
Project: Flink
Issue Type: Bug
Components: Distributed Coordination
Affects Versions: 1.7.1, 1.6.3, 1.8.0
Reporter: Till Rohrmann
Assignee: Till Rohrmann
Fix For: 1.8.0
The {{JobManagerRunner}} does not wait for the suspension of the {{JobMaster}} to finish before granting leadership again. This can lead to a state where the {{JobMaster}} tries to start the {{ExecutionGraph}} but the {{SlotPool}} is still stopped.
I suggest to linearize the leadership operations (granting and revoking leadership) similarly to the {{Dispatcher}}.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)