You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2022/09/28 08:52:49 UTC

[GitHub] [hudi] xushiyan commented on pull request #5052: [HUDI-3644] hoodie log scan bug cause data duplication bugfix

xushiyan commented on PR #5052:
URL: https://github.com/apache/hudi/pull/5052#issuecomment-1260593200

   > im explained in jira :https://issues.apache.org/jira/browse/HUDI-3644
   > 
   > when log file block append sucess. And deltacommit rollback. And this instance time is before activeTimeline starts. This log file block will be merged, cause data duplication.
   
   So the situation is: there is a deltacommit and a rollback commit, then archive happened and deltacommit archived. As @yihua mentioned `ROLLBACK_PREVIOUS_BLOCK` is to prevent merging invalid delta log data. So will proceed to close this.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org