You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "xi chaomin (Jira)" <ji...@apache.org> on 2022/07/26 02:47:00 UTC
[jira] [Commented] (HUDI-4426) The implementation of Clean is inconsistent with CLEANER_COMMITS_RETAINED
[ https://issues.apache.org/jira/browse/HUDI-4426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17571177#comment-17571177 ]
xi chaomin commented on HUDI-4426:
----------------------------------
The behavior is designed, not a problem.
> The implementation of Clean is inconsistent with CLEANER_COMMITS_RETAINED
> -------------------------------------------------------------------------
>
> Key: HUDI-4426
> URL: https://issues.apache.org/jira/browse/HUDI-4426
> Project: Apache Hudi
> Issue Type: Improvement
> Components: cleaning
> Reporter: xi chaomin
> Priority: Major
> Labels: pull-request-available
> Attachments: WX20220720-175720.png, WX20220720-175741.png, WX20220720-180156.png
>
>
> The commits and files before clean
> !WX20220720-175741.png!
> !WX20220720-175720.png!
> Files after I run cleans run --sparkMaster local --hoodieConfigs hoodie.cleaner.commits.retained=1
> !WX20220720-180156.png!
> We should keep the latest one file slice, 90e7e5d5-0ab0-436e-aeee-ec0935007e21-0_1-197-308_20220720175340478.parquet and 3f90caae-bf8e-410c-ad56-f79672227bde-0_1-109-154_20220720175240602.parquet should be cleaned.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)