You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Rajesh Mahindra (Jira)" <ji...@apache.org> on 2022/02/01 04:31:00 UTC

[jira] [Updated] (HUDI-3343) Metadata Table includes Uncommitted Log Files during Bootstrap

     [ https://issues.apache.org/jira/browse/HUDI-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rajesh Mahindra updated HUDI-3343:
----------------------------------
    Sprint: Hudi-Sprint-Jan-24, Hudi-Sprint-Jan-31  (was: Hudi-Sprint-Jan-24)

> Metadata Table includes Uncommitted Log Files during Bootstrap
> --------------------------------------------------------------
>
>                 Key: HUDI-3343
>                 URL: https://issues.apache.org/jira/browse/HUDI-3343
>             Project: Apache Hudi
>          Issue Type: Bug
>          Components: metadata
>            Reporter: Alexey Kudinkin
>            Assignee: Alexey Kudinkin
>            Priority: Blocker
>             Fix For: 0.11.0
>
>
> While working on a fix for HUDI-3322, discovered a following issue:
> If we're bootstrapping the MT during pending Rollback operation (this could happen when previous writer had MT *disabled* when writing the data), since bootstrapping is done _after_ Rollback is executed (with its side-effects already being reflected on FS) bootstrapping would incorrectly include intermediary files created by the Rollback (like log-files being created with Rollback Command Block appended).
>  
> Filtering of the files is performed here: 
> https://github.com/apache/hudi/blob/master/hudi-client/hudi-client-common/src/main/java/org/apache/hudi/metadata/HoodieBackedTableMetadataWriter.java#L752
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)