You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "Jiabao Sun (Jira)" <ji...@apache.org> on 2023/02/14 10:49:00 UTC

[jira] [Created] (FLINK-31063) Prevent duplicate reading when restoring from a checkpoint.

Jiabao Sun created FLINK-31063:
----------------------------------

             Summary: Prevent duplicate reading when restoring from a checkpoint.
                 Key: FLINK-31063
                 URL: https://issues.apache.org/jira/browse/FLINK-31063
             Project: Flink
          Issue Type: Bug
          Components: Connectors / MongoDB
    Affects Versions: mongodb-1.0.0
            Reporter: Jiabao Sun
             Fix For: mongodb-1.0.0


Exact-once semantics may not be guaranteed at present on partial reads.
We use a number fetchSize to limit the records count for every fetch loop but we didn't record the offset into the split state. When resuming the split reader from a partially completed split, we may re-read some data.

We should record the current reading offset into split state.
Skip this offset when restoring to prevent duplicate reading.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)