You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@impala.apache.org by cs...@apache.org on 2022/08/30 10:57:04 UTC

[impala] 02/02: IMPALA-11483: [DOCS] Document EXPIRE SNAPSHOTS functionality

This is an automated email from the ASF dual-hosted git repository.

csringhofer pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/impala.git

commit 876829951dc23ac044e79c3ed3088207d812d3bb
Author: Tamas Mate <tm...@cloudera.com>
AuthorDate: Thu Aug 11 16:43:44 2022 +0200

    IMPALA-11483: [DOCS] Document EXPIRE SNAPSHOTS functionality
    
    This commit documents the Iceberg expire snapshot functionality.
    
    Testing:
     - Built the docs locally and verified the generated html.
    
    Change-Id: Iddd927dd1b6ee3b99dd29982018a610389e64836
    Reviewed-on: http://gerrit.cloudera.org:8080/18838
    Reviewed-by: Tamas Mate <tm...@apache.org>
    Tested-by: Tamas Mate <tm...@apache.org>
---
 docs/topics/impala_iceberg.xml | 32 ++++++++++++++++++++++++++++++++
 1 file changed, 32 insertions(+)

diff --git a/docs/topics/impala_iceberg.xml b/docs/topics/impala_iceberg.xml
index 78c865832..1ae133623 100644
--- a/docs/topics/impala_iceberg.xml
+++ b/docs/topics/impala_iceberg.xml
@@ -506,6 +506,38 @@ CREATE TABLE new_ice_tbl LIKE orig_ice_tbl;
     </conbody>
   </concept>
 
+  <concept id="iceberg_expire_snapshots">
+    <title>Expiring snapshots</title>
+    <conbody>
+      <p>
+        Iceberg snapshots accumulate until they are deleted by a user action. Snapshots
+        can be deleted with <codeph>ALTER TABLE ... EXECUTE expire_snapshots(...)</codeph>
+        statement, which will expire snapshots that are older than the specified
+        timestamp. For example:
+        <codeblock>
+ALTER TABLE ice_tbl EXECUTE expire_snapshots('2022-01-04 10:00:00');
+ALTER TABLE ice_tbl EXECUTE expire_snapshots(now() - interval 5 days);
+        </codeblock>
+      </p>
+      <p>
+        Expire snapshots:
+        <ul>
+          <li>does not remove old metadata files by default.</li>
+          <li>does not remove orphaned data files.</li>
+          <li>respects the minimum number of snapshots to keep:
+          <codeph>history.expire.min-snapshots-to-keep</codeph> table property.</li>
+        </ul>
+      </p>
+      <p>
+        Old metadata file clean up can be configured with
+        <codeph>write.metadata.delete-after-commit.enabled=true</codeph> and
+        <codeph>write.metadata.previous-versions-max</codeph> table properties. This
+        allows automatic metadata file removal after operations that modify metadata
+        such as expiring snapshots or inserting data.
+      </p>
+    </conbody>
+  </concept>
+
   <concept id="iceberg_table_properties">
     <title>Iceberg table properties</title>
     <conbody>