You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "Yun Tang (Jira)" <ji...@apache.org> on 2021/12/29 13:01:00 UTC
[jira] [Created] (FLINK-25478) Changelog materialization with incremental checkpoint could cause checkpointed data lost
Yun Tang created FLINK-25478:
--------------------------------
Summary: Changelog materialization with incremental checkpoint could cause checkpointed data lost
Key: FLINK-25478
URL: https://issues.apache.org/jira/browse/FLINK-25478
Project: Flink
Issue Type: Bug
Components: Runtime / Checkpointing, Runtime / State Backends
Reporter: Yun Tang
Fix For: 1.15.0
Currently, changelog materialization would call RocksDB state backend's snapshot method to generate {{IncrementalRemoteKeyedStateHandle}} as ChangelogStateBackendHandleImpl's materialized artifacts. And before next materialization, it will always report the same {{IncrementalRemoteKeyedStateHandle}} as before.
It's fine to register this for the 1st time. However, for the 2nd time to register {{IncrementalRemoteKeyedStateHandle}} (via {{ChangelogStateBackendHandleImpl#registerSharedStates}}), it will discard the private state artifacts without check the register reference:
IncrementalRemoteKeyedStateHandle:
{code:java}
public void discardState() throws Exception {
try {
StateUtil.bestEffortDiscardAllStateObjects(privateState.values());
} catch (Exception e) {
LOG.warn("Could not properly discard misc file states.", e);
}
}
{code}
Thus, this would delete the private state (such as RocksDB's MAINFEST), and once restore, job would not report FileNotFoundException.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)