You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "ehurheap (via GitHub)" <gi...@apache.org> on 2023/03/20 16:15:58 UTC

[GitHub] [hudi] ehurheap commented on issue #8209: [SUPPORT] auto_clean stopped running during ingest

ehurheap commented on issue #8209:
URL: https://github.com/apache/hudi/issues/8209#issuecomment-1476537258

   Prior to the earliest `deltacommit`, we have 23 filegroups with cleans inflight: for example:
   ```
   20221119132417991.clean
   20221119132417991.clean.inflight
   20221119132417991.clean.requested
   ```
   
   Then the earliest `deltacommit` is:
   ```
   20221208164706388.deltacommit
   20221208164706388.deltacommit.inflight
   20221208164706388.deltacommit.requested
   20221208164706388.savepoint
   20221208164706388.savepoint.inflight
   ```
   Following that filegroup there are a few `replacecommit`s:
   ```
   20221208174357204.replacecommit
   20221208174357204.replacecommit.inflight
   20221208174357204.replacecommit.requested
   ```
   etc. Followed by more `deltacommits`.
   
   
   - Are these in conflict somehow?
   - How can we tell what has been archived?
   - Is it possible that the archive/cleaner configs can be in conflict with each other?
   
   (This is from looking at the files in the S3 console - trying to use the hudi-cli commands is very slow, as it can take an hour to list commits for this table.)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org