You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Aljoscha Krettek (JIRA)" <ji...@apache.org> on 2016/04/29 13:50:12 UTC

[jira] [Commented] (FLINK-3844) Checkpoint failures should not always lead to job failures

    [ https://issues.apache.org/jira/browse/FLINK-3844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15263946#comment-15263946 ] 

Aljoscha Krettek commented on FLINK-3844:
-----------------------------------------

+1, we could have something similar to {{RestartStrategy}} but for checkpoints that determines when failing checkpoints should crash a job.

> Checkpoint failures should not always lead to job failures
> ----------------------------------------------------------
>
>                 Key: FLINK-3844
>                 URL: https://issues.apache.org/jira/browse/FLINK-3844
>             Project: Flink
>          Issue Type: Improvement
>          Components: Streaming
>            Reporter: Gyula Fora
>
> Currently when a checkpoint fails the job crashes immediately. This is not the desired behaviour in many cases. It would probably be better to log the failed checkpoint attempt and only fail the job after so many subsequent failed attempts.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)