You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Yanfei Lei (Jira)" <ji...@apache.org> on 2022/04/13 10:35:00 UTC

[jira] [Commented] (FLINK-25872) Restoring from non-changelog checkpoint with changelog state-backend enabled in CLAIM mode discards state in use

    [ https://issues.apache.org/jira/browse/FLINK-25872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521607#comment-17521607 ] 

Yanfei Lei commented on FLINK-25872:
------------------------------------

hi [~roman], I wrote a document([https://docs.google.com/document/d/1KSFWc0gL7HkhC-JNrnsp06TLnsTmZOTHITQDcGMo0cI/edit?usp=sharing|https://docs.google.com/document/d/1KSFWc0gL7HkhC-JNrnsp06TLnsTmZOTHITQDcGMo0cI/edit?usp=sharing,]) about this ticket[,|https://docs.google.com/document/d/1KSFWc0gL7HkhC-JNrnsp06TLnsTmZOTHITQDcGMo0cI/edit?usp=sharing,] would you please take a review and give some advice?  

 
{quote}I think registering *all* {{{}KeyedStateHandle{}}}s with the {{SharedStateRegistry}} on recovery in {{CLAIM}} mode would also solve the problem, wouldn't it?

The advantage is that JM wouldn't have to know anything about the changelog.
I think this is important and that's why I'd prefer such an approach.
{quote}
I think only `{{{}registering *all* {{{}KeyedStateHandle{}}}s with the {{SharedStateRegistry}}{}}}` may not work as well, because the {{discardState()}} of KeyedStateHandles are {*}not empty{*}, although all {{KeyedStateHandles}} are registered to {{{}SharedStateRegistry{}}}, the state would be discarded on checkpoint subsuming(maybe I overlooked something)? 

> Restoring from non-changelog checkpoint with changelog state-backend enabled in CLAIM mode discards state in use
> ----------------------------------------------------------------------------------------------------------------
>
>                 Key: FLINK-25872
>                 URL: https://issues.apache.org/jira/browse/FLINK-25872
>             Project: Flink
>          Issue Type: Bug
>          Components: Runtime / Checkpointing, Runtime / State Backends
>            Reporter: Yun Tang
>            Assignee: Yanfei Lei
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 1.16.0
>
>
> If we restore from checkpoint with changelog state-backend enabled in snapshot CLAIM mode, the restored checkpoint would be discarded on subsume. This invalidates newer/active checkpoints because their materialized part is discarded (for incremental wrapped checkpoints, their private state is discarded). This bug is like FLINK-25478.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)