You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "hbgstc123 (via GitHub)" <gi...@apache.org> on 2023/04/13 14:30:52 UTC

[GitHub] [hudi] hbgstc123 commented on pull request #8443: [HUDI-6068] Improve logic of getOldestInstantToRetainForClustering wh…

hbgstc123 commented on PR #8443:
URL: https://github.com/apache/hudi/pull/8443#issuecomment-1507074766

   > @hbgstc123, which border situation could cause that there maybe a moment when the last clean is complete and the next clean plan not generated, if timeline archive execute at this moment, no replace commit will be retained?
   
   For example in flink pipeline, clean is scheduled and executed in class `Cleanfunction` when function `snapshotState()` is invoked at the beginning of a checkpoint, so after a clean is complete and before the next checkpoint is triggered, there is no inflight clean instant in timeline.  
   And even if last clean operation complete the moment before the next checkpoint begin, the generation of clean plan will take time, the bigger the table the longer it may takes, during this clean planning time, there is no inflight clean instant in timeline neither.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org