You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Stephan Ewen (JIRA)" <ji...@apache.org> on 2016/08/05 18:11:20 UTC

[jira] [Created] (FLINK-4323) Checkpoint Coordinator Removes HA Checkpoints in Shutdown

Stephan Ewen created FLINK-4323:
-----------------------------------

             Summary: Checkpoint Coordinator Removes HA Checkpoints in Shutdown
                 Key: FLINK-4323
                 URL: https://issues.apache.org/jira/browse/FLINK-4323
             Project: Flink
          Issue Type: Bug
          Components: State Backends, Checkpointing
    Affects Versions: 1.1.0
            Reporter: Stephan Ewen
            Priority: Blocker
             Fix For: 1.2.0, 1.1.1


The {{CheckpointCoordinator}} has a shutdown hook that "shuts down" the savepoint store, rather than suspending it.

As a consequence, HA checkpoints may be lost when the JobManager process fails but allows the shutdown hook to run.

I would suggest to remove the sutdown hook from the CheckpointCoordinator all together. The JobManager process is responsible for cleanups and can better decide what should be cleaned up and what not.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)