You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Ravi Bhushan Ratnakar (Jira)" <ji...@apache.org> on 2020/07/22 16:39:00 UTC

[jira] [Commented] (FLINK-18675) Checkpoint not maintaining minimum pause duration between checkpoints

    [ https://issues.apache.org/jira/browse/FLINK-18675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17162932#comment-17162932 ] 

Ravi Bhushan Ratnakar commented on FLINK-18675:
-----------------------------------------------

As per my understanding of the code, in the CheckpointCordinator class, at line number [1512|[https://github.com/apache/flink/blob/release-1.11.0/flink-runtime/src/main/java/org/apache/flink/runtime/checkpoint/CheckpointCoordinator.java#L1512]] ,scheduleAtFixedRate method is being used. I think that we should use "[scheduleWithFixedDelay|[https://docs.oracle.com/javase/8/docs/api/java/util/concurrent/ScheduledExecutorService.html#scheduleWithFixedDelay-java.lang.Runnable-long-long-java.util.concurrent.TimeUnit-]]{{"}}

> Checkpoint not maintaining minimum pause duration between checkpoints
> ---------------------------------------------------------------------
>
>                 Key: FLINK-18675
>                 URL: https://issues.apache.org/jira/browse/FLINK-18675
>             Project: Flink
>          Issue Type: Bug
>          Components: Runtime / Checkpointing
>    Affects Versions: 1.11.0
>         Environment: !image.png!
>            Reporter: Ravi Bhushan Ratnakar
>            Priority: Critical
>         Attachments: image.png
>
>
> I am running a streaming job with Flink 1.11.0 using kubernetes infrastructure. I have configured checkpoint configuration like below
> Interval - 3 minutes
> Minimum pause between checkpoints - 3 minutes
> Checkpoint timeout - 10 minutes
> Checkpointing Mode - Exactly Once
> Number of Concurrent Checkpoint - 1
>  
> Other configs
> Time Characteristics - Processing Time
>  
> I am observing an usual behaviour. *When a checkpoint completes successfully* *and if it's end to end duration is almost equal or greater than Minimum pause duration then the next checkpoint gets triggered immediately without maintaining the Minimum pause duration*. Kindly notice this behaviour from checkpoint id 194 onward in the attached screenshot



--
This message was sent by Atlassian Jira
(v8.3.4#803005)