You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@bookkeeper.apache.org by "Yiming Zang (JIRA)" <ji...@apache.org> on 2017/04/20 05:46:04 UTC

[jira] [Created] (BOOKKEEPER-1040) Use separate log for compaction

Yiming Zang created BOOKKEEPER-1040:
---------------------------------------

             Summary: Use separate log for compaction
                 Key: BOOKKEEPER-1040
                 URL: https://issues.apache.org/jira/browse/BOOKKEEPER-1040
             Project: Bookkeeper
          Issue Type: Bug
          Components: bookkeeper-server
    Affects Versions: 4.3.2
            Reporter: Yiming Zang
            Assignee: Yiming Zang


Bookkeeper is not able to reclaim disk space when it's full
If all disks are full or almost full, both major and minor compactions would be suspended, and only GC will be running. In the current design, this is the right thing to do, because when disks are full, EntryLogger can not allocate any new entry logs any more, and apart from that,  the intention is to prevent disk usage from keep growing.
However, the problem is if we have a mixed of short-lived ledgers and long-lived ledgers in all entry logs, when disks are full, GC wouldn't be able to delete any entry logs, and if compaction is disabled, bookie can't reclaim any disk space any more by itself.
Compaction might keep generating duplicated data which would cause disk full
Currently, there's no transactional operation for compaction. In the current CompactionScannerFactory, if it fails to flush entry log file, or fails to flush ledgerCache, the data which is already flushed wouldn't be deleted, and the entry log that is being compacted will be retried again for the next time, which would generate duplicated data.
Moreover, if the entry log being compacted has long-lived data and the compaction keeps failing for some reason(e.g. corrupted entry, corrupted index), it would cause the BK disk usage keep growing until the either the entry log can be garbage collected, or disk full.




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)