You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2022/01/02 15:36:26 UTC

[GitHub] [hudi] nsivabalan commented on pull request #4078: [HUDI-2833] Clean up unused archive files instead of expanding indefinitely.

nsivabalan commented on pull request #4078:
URL: https://github.com/apache/hudi/pull/4078#issuecomment-1003733631


   Thanks to the detailed code flow diagram. I have one question on the flow. 
   When a job failed during cleaning up of small archive files: 
   1. We deserialize the plan and get all candidates. 
   2. If not all candidates exists, it means that stage 4 succeeded and stage 5 failed. 
   3. Delete all left over small archive files. 
   4. Go on to "Select merge candidates" : This refers to a new batch of merging right? From the diagram its not very apparent whether we are retrying the same batch or going ahead w/ a new batch. Can you confirm my understanding here please. 
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org