You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Ufuk Celebi (JIRA)" <ji...@apache.org> on 2016/07/15 16:25:20 UTC

[jira] [Commented] (FLINK-4201) Checkpoints for jobs in non-terminal state (e.g. suspended) get deleted

    [ https://issues.apache.org/jira/browse/FLINK-4201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15379639#comment-15379639 ] 

Ufuk Celebi commented on FLINK-4201:
------------------------------------

The shut down hook is actually not a problem, because it is only active in standalone recovery mode. The issue is that a suspended execution graph will shut down the checkpoint coordinator, which discards all checkpoints on shut down. We still need to call shutdown in order to free some resources like the timer task, but have to skip discarding the checkpoints if the execution graph is suspended and not in a globally terminal state.

> Checkpoints for jobs in non-terminal state (e.g. suspended) get deleted
> -----------------------------------------------------------------------
>
>                 Key: FLINK-4201
>                 URL: https://issues.apache.org/jira/browse/FLINK-4201
>             Project: Flink
>          Issue Type: Bug
>          Components: State Backends, Checkpointing
>            Reporter: Stefan Richter
>            Assignee: Ufuk Celebi
>            Priority: Blocker
>
> For example, when shutting down a Yarn session, according to the logs checkpoints for jobs that did not terminate are deleted. In the shutdown hook, removeAllCheckpoints is called and removes checkpoints that should still be kept.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)