You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Yun Gao (Jira)" <ji...@apache.org> on 2021/01/22 04:03:00 UTC

[jira] [Updated] (FLINK-21080) Identify JobVertex containing legacy source operators and abort checkpoint with legacy source operators partially finished

     [ https://issues.apache.org/jira/browse/FLINK-21080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Yun Gao updated FLINK-21080:
----------------------------
    Component/s: Runtime / Checkpointing
                 API / DataStream

> Identify JobVertex containing legacy source operators and abort checkpoint with legacy source operators partially finished
> --------------------------------------------------------------------------------------------------------------------------
>
>                 Key: FLINK-21080
>                 URL: https://issues.apache.org/jira/browse/FLINK-21080
>             Project: Flink
>          Issue Type: Sub-task
>          Components: API / DataStream, Runtime / Checkpointing
>            Reporter: Yun Gao
>            Assignee: Guowei Ma
>            Priority: Major
>
> Most legacy source operators would record the offset for each partitions, and after recovery it would read from the recorded offset. If before a checkpoint some subtasks are finished, the corresponding partition offsets would be deserted in the checkpoint. Then if the job recover with this checkpoint, the legacy source would re-discovery all the partitions and for those finished tasks, the legacy source would re-read them since their offsets are not recorded. 
> Therefore, we would like to fail the checkpoint if some legacy source operators have part of subtasks finished. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)