You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "xi chaomin (Jira)" <ji...@apache.org> on 2022/07/26 02:47:00 UTC

[jira] [Commented] (HUDI-4426) The implementation of Clean is inconsistent with CLEANER_COMMITS_RETAINED

    [ https://issues.apache.org/jira/browse/HUDI-4426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17571177#comment-17571177 ] 

xi chaomin commented on HUDI-4426:
----------------------------------

The behavior is designed, not a problem.

> The implementation of Clean is inconsistent with CLEANER_COMMITS_RETAINED
> -------------------------------------------------------------------------
>
>                 Key: HUDI-4426
>                 URL: https://issues.apache.org/jira/browse/HUDI-4426
>             Project: Apache Hudi
>          Issue Type: Improvement
>          Components: cleaning
>            Reporter: xi chaomin
>            Priority: Major
>              Labels: pull-request-available
>         Attachments: WX20220720-175720.png, WX20220720-175741.png, WX20220720-180156.png
>
>
> The commits and files before clean
>  !WX20220720-175741.png! 
>  !WX20220720-175720.png! 
> Files after I run cleans run --sparkMaster local --hoodieConfigs hoodie.cleaner.commits.retained=1
>  !WX20220720-180156.png! 
> We should keep the latest one file slice,  90e7e5d5-0ab0-436e-aeee-ec0935007e21-0_1-197-308_20220720175340478.parquet and 3f90caae-bf8e-410c-ad56-f79672227bde-0_1-109-154_20220720175240602.parquet should be cleaned.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)