You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Stephan Ewen (JIRA)" <ji...@apache.org> on 2016/08/05 18:11:20 UTC
[jira] [Created] (FLINK-4323) Checkpoint Coordinator Removes HA
Checkpoints in Shutdown
Stephan Ewen created FLINK-4323:
-----------------------------------
Summary: Checkpoint Coordinator Removes HA Checkpoints in Shutdown
Key: FLINK-4323
URL: https://issues.apache.org/jira/browse/FLINK-4323
Project: Flink
Issue Type: Bug
Components: State Backends, Checkpointing
Affects Versions: 1.1.0
Reporter: Stephan Ewen
Priority: Blocker
Fix For: 1.2.0, 1.1.1
The {{CheckpointCoordinator}} has a shutdown hook that "shuts down" the savepoint store, rather than suspending it.
As a consequence, HA checkpoints may be lost when the JobManager process fails but allows the shutdown hook to run.
I would suggest to remove the sutdown hook from the CheckpointCoordinator all together. The JobManager process is responsible for cleanups and can better decide what should be cleaned up and what not.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)