You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "Congxian Qiu(klion26) (JIRA)" <ji...@apache.org> on 2019/05/24 12:44:00 UTC

[jira] [Created] (FLINK-12619) Support TERMINATE/SUSPEND Job with Checkpoint

Congxian Qiu(klion26) created FLINK-12619:
---------------------------------------------

             Summary: Support TERMINATE/SUSPEND Job with Checkpoint
                 Key: FLINK-12619
                 URL: https://issues.apache.org/jira/browse/FLINK-12619
             Project: Flink
          Issue Type: New Feature
          Components: Runtime / State Backends
            Reporter: Congxian Qiu(klion26)
            Assignee: Congxian Qiu(klion26)


Inspired by the idea of FLINK-11458, we propose to support terminate/suspend a job with checkpoint. This improvement cooperates with incremental and external checkpoint features, that if checkpoint is retained and this feature is configured, we will trigger a checkpoint before the job stops. It could accelarate job recovery a lot since:
1. No source rewinding required any more.
2. It's much faster than taking a savepoint since incremental checkpoint is enabled.

Please note that conceptually savepoints is different from checkpoint in a similar way that backups are different from recovery logs in traditional database systems. So we suggest using this feature only for job recovery, while stick with FLINK-11458 for the upgrading/cross-cluster-job-migration/state-backend-switch cases.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)