You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Denys Kuzmenko (Jira)" <ji...@apache.org> on 2022/12/12 08:25:00 UTC
[jira] (HIVE-24291) Compaction Cleaner prematurely cleans up deltas
[ https://issues.apache.org/jira/browse/HIVE-24291 ]
Denys Kuzmenko deleted comment on HIVE-24291:
---------------------------------------
was (Author: dkuzmenko):
Merged to master.
[~kkasa], thank you for the review!
> Compaction Cleaner prematurely cleans up deltas
> -----------------------------------------------
>
> Key: HIVE-24291
> URL: https://issues.apache.org/jira/browse/HIVE-24291
> Project: Hive
> Issue Type: Bug
> Reporter: Peter Varga
> Assignee: Peter Varga
> Priority: Major
> Labels: pull-request-available
> Fix For: 4.0.0-alpha-1
>
> Time Spent: 2h 10m
> Remaining Estimate: 0h
>
> Since HIVE-23107 the cleaner can clean up deltas that are still used by running queries.
> Example:
> * TxnId 1-5 writes to a partition, all commits
> * Compactor starts with txnId=6
> * Long running query starts with txnId=7, it sees txnId=6 as open in its snapshot
> * Compaction commits
> * Cleaner runs
> Previously min_history_level table would have prevented the Cleaner to delete the deltas1-5 until txnId=7 is open, but now they will be deleted and the long running query may fail if its tries to access the files.
> Solution could be to not run the cleaner until any txn is open that was opened before the compaction was committed (CQ_NEXT_TXN_ID)
--
This message was sent by Atlassian Jira
(v8.20.10#820010)