You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "satish (Jira)" <ji...@apache.org> on 2020/10/06 02:40:00 UTC

[jira] [Resolved] (HUDI-1072) Reader changes to support clustering and insert overwrite

     [ https://issues.apache.org/jira/browse/HUDI-1072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

satish resolved HUDI-1072.
--------------------------
    Resolution: Fixed

> Reader changes to support clustering and insert overwrite
> ---------------------------------------------------------
>
>                 Key: HUDI-1072
>                 URL: https://issues.apache.org/jira/browse/HUDI-1072
>             Project: Apache Hudi
>          Issue Type: Sub-task
>            Reporter: satish
>            Assignee: satish
>            Priority: Major
>              Labels: pull-request-available
>
> * Add metadata to track ‘replaced’ files. Replaced files are essentially file groups to be ignored. For ‘insert overwrite’ this is all existing files in the partition overwritten. For ‘clustering’, this is all file groups that are merged into a new set of file groups.
> * Change Views to ignore replaced files (AbstractTableFileSystemView and all subclasses)
> * Change cleaner to delete data files that have been replaced (Introduce a new policy?)
> * Change archival to not delete active commits that have this special metadata if corresponding data files are not deleted.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)