You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Yanfei Lei (Jira)" <ji...@apache.org> on 2022/04/13 10:35:00 UTC
[jira] [Commented] (FLINK-25872) Restoring from non-changelog checkpoint with changelog state-backend enabled in CLAIM mode discards state in use
[ https://issues.apache.org/jira/browse/FLINK-25872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521607#comment-17521607 ]
Yanfei Lei commented on FLINK-25872:
------------------------------------
hi [~roman], I wrote a document([https://docs.google.com/document/d/1KSFWc0gL7HkhC-JNrnsp06TLnsTmZOTHITQDcGMo0cI/edit?usp=sharing|https://docs.google.com/document/d/1KSFWc0gL7HkhC-JNrnsp06TLnsTmZOTHITQDcGMo0cI/edit?usp=sharing,]) about this ticket[,|https://docs.google.com/document/d/1KSFWc0gL7HkhC-JNrnsp06TLnsTmZOTHITQDcGMo0cI/edit?usp=sharing,] would you please take a review and give some advice?
{quote}I think registering *all* {{{}KeyedStateHandle{}}}s with the {{SharedStateRegistry}} on recovery in {{CLAIM}} mode would also solve the problem, wouldn't it?
The advantage is that JM wouldn't have to know anything about the changelog.
I think this is important and that's why I'd prefer such an approach.
{quote}
I think only `{{{}registering *all* {{{}KeyedStateHandle{}}}s with the {{SharedStateRegistry}}{}}}` may not work as well, because the {{discardState()}} of KeyedStateHandles are {*}not empty{*}, although all {{KeyedStateHandles}} are registered to {{{}SharedStateRegistry{}}}, the state would be discarded on checkpoint subsuming(maybe I overlooked something)?
> Restoring from non-changelog checkpoint with changelog state-backend enabled in CLAIM mode discards state in use
> ----------------------------------------------------------------------------------------------------------------
>
> Key: FLINK-25872
> URL: https://issues.apache.org/jira/browse/FLINK-25872
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Checkpointing, Runtime / State Backends
> Reporter: Yun Tang
> Assignee: Yanfei Lei
> Priority: Major
> Labels: pull-request-available
> Fix For: 1.16.0
>
>
> If we restore from checkpoint with changelog state-backend enabled in snapshot CLAIM mode, the restored checkpoint would be discarded on subsume. This invalidates newer/active checkpoints because their materialized part is discarded (for incremental wrapped checkpoints, their private state is discarded). This bug is like FLINK-25478.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)