You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "Arvid Heise (Jira)" <ji...@apache.org> on 2021/04/07 09:28:00 UTC

[jira] [Created] (FLINK-22137) Execute unaligned checkpoint test on a cluster

Arvid Heise created FLINK-22137:
-----------------------------------

             Summary: Execute unaligned checkpoint test on a cluster
                 Key: FLINK-22137
                 URL: https://issues.apache.org/jira/browse/FLINK-22137
             Project: Flink
          Issue Type: Sub-task
            Reporter: Arvid Heise


Start application and at some point cancel/induce failure, the user needs to restart from a retained checkpoint with

*     lower
*     same
*     higher degree of parallelism.

To enable unaligned checkpoints, set

*     execution.checkpointing.unaligned: true
*     execution.checkpointing.alignment-timeout to 0s, 10s, 1min (for high backpressure)

The primary objective is to check if all data is recovered properly and if the semantics is correct (does state match input?).

The secondary objective is to check if Flink UI shows the information correctly:

*     unaligned checkpoint enabled on job level
*     timeout on job level
*     for each checkpoint, if it's unaligned or not; how much data was written



--
This message was sent by Atlassian Jira
(v8.3.4#803005)