You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Antal Sinkovits (Jira)" <ji...@apache.org> on 2022/03/29 07:24:00 UTC

[jira] [Assigned] (HIVE-25492) Major query-based compaction is skipped if partition is empty

     [ https://issues.apache.org/jira/browse/HIVE-25492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Antal Sinkovits reassigned HIVE-25492:
--------------------------------------

    Assignee: Antal Sinkovits

> Major query-based compaction is skipped if partition is empty
> -------------------------------------------------------------
>
>                 Key: HIVE-25492
>                 URL: https://issues.apache.org/jira/browse/HIVE-25492
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Karen Coppage
>            Assignee: Antal Sinkovits
>            Priority: Major
>
> Currently if the result of query-based compaction is an empty base, delta, or delete delta, the empty directory is deleted.
> This is because of minor compaction – if there are only deltas to compact, then no compacted delete delta should be created (only a compacted delta). In the same way, if there are only delete deltas to compact, then no compacted delta should be created (only a compacted delete delta).
> There is an issue with major compaction. If all the data in the partition has been deleted, then we should get an empty base directory after compaction. Instead, the empty base directory is deleted because it's empty and compaction claims to succeed but we end up with the same deltas/delete deltas we started with – basically compaction does not run.
> Where to start? MajorQueryCompactor#commitCompaction



--
This message was sent by Atlassian Jira
(v8.20.1#820001)