You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "Zhijiang (Jira)" <ji...@apache.org> on 2020/06/02 09:08:00 UTC

[jira] [Created] (FLINK-18063) Fix the race condition for aborting current checkpoint in CheckpointBarrierUnaligner#processEndOfPartition

Zhijiang created FLINK-18063:
--------------------------------

             Summary: Fix the race condition for aborting current checkpoint in CheckpointBarrierUnaligner#processEndOfPartition
                 Key: FLINK-18063
                 URL: https://issues.apache.org/jira/browse/FLINK-18063
             Project: Flink
          Issue Type: Bug
          Components: Runtime / Checkpointing
    Affects Versions: 1.11.0
            Reporter: Zhijiang
            Assignee: Zhijiang
             Fix For: 1.11.0, 1.12.0


In the handle of CheckpointBarrierUnaligner#processEndOfPartition, it only aborts the current checkpoint by judging the condition of pending checkpoint from task thread processing, so it will miss one scenario that checkpoint triggered by notifyBarrierReceived from netty thread.

The proper fix should also judge the pending checkpoint inside ThreadSafeUnaligner in order to abort it and reset internal variables in case.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)