You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Yun Gao (Jira)" <ji...@apache.org> on 2021/01/22 04:03:00 UTC
[jira] [Updated] (FLINK-21080) Identify JobVertex containing legacy
source operators and abort checkpoint with legacy source operators
partially finished
[ https://issues.apache.org/jira/browse/FLINK-21080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Yun Gao updated FLINK-21080:
----------------------------
Component/s: Runtime / Checkpointing
API / DataStream
> Identify JobVertex containing legacy source operators and abort checkpoint with legacy source operators partially finished
> --------------------------------------------------------------------------------------------------------------------------
>
> Key: FLINK-21080
> URL: https://issues.apache.org/jira/browse/FLINK-21080
> Project: Flink
> Issue Type: Sub-task
> Components: API / DataStream, Runtime / Checkpointing
> Reporter: Yun Gao
> Assignee: Guowei Ma
> Priority: Major
>
> Most legacy source operators would record the offset for each partitions, and after recovery it would read from the recorded offset. If before a checkpoint some subtasks are finished, the corresponding partition offsets would be deserted in the checkpoint. Then if the job recover with this checkpoint, the legacy source would re-discovery all the partitions and for those finished tasks, the legacy source would re-read them since their offsets are not recorded.
> Therefore, we would like to fail the checkpoint if some legacy source operators have part of subtasks finished.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)