You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Matthias Pohl (Jira)" <ji...@apache.org> on 2022/04/22 11:13:00 UTC
[jira] [Created] (FLINK-27355) JobManagerRunnerRegistry.localCleanupAsync does not call the JobManagerRunner.close method repeatedly
Matthias Pohl created FLINK-27355:
-------------------------------------
Summary: JobManagerRunnerRegistry.localCleanupAsync does not call the JobManagerRunner.close method repeatedly
Key: FLINK-27355
URL: https://issues.apache.org/jira/browse/FLINK-27355
Project: Flink
Issue Type: Bug
Components: Runtime / Coordination
Affects Versions: 1.15.0
Reporter: Matthias Pohl
The {{DefaultJobManagerRunner.localCleanupAsync}} method deregisters the JobManagerRunner and calls close on it. If close fails for whatever reason, it will be identified but the next retry would just notice that the JobManagerRunner is already deregistered and not do anything.
Hence, JobMaster shutdown won't be retriggered (i.e. errors in the {{CompletedCheckpointStore}} or the {{CheckpointIDCounter}} won't be handled). FLINK-26114 is related: Both components don't expose any errors right now, anyway.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)