You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Zoltán Borók-Nagy (Jira)" <ji...@apache.org> on 2022/08/22 15:46:00 UTC

[jira] [Created] (IMPALA-11518) Use Iceberg stats for Table Stats if there's no stats in HMS

Zoltán Borók-Nagy created IMPALA-11518:
------------------------------------------

             Summary: Use Iceberg stats for Table Stats if there's no stats in HMS
                 Key: IMPALA-11518
                 URL: https://issues.apache.org/jira/browse/IMPALA-11518
             Project: IMPALA
          Issue Type: Bug
          Components: Frontend
            Reporter: Zoltán Borók-Nagy


Currently we only use HMS stats, even for Iceberg tables:
https://github.com/apache/impala/blob/14b9fb97b59de1749fd93b0469ff8ea6f030021d/fe/src/main/java/org/apache/impala/catalog/local/LocalTable.java#L163
https://github.com/apache/impala/blob/master/fe/src/main/java/org/apache/impala/catalog/Table.java#L413

IcebergTable and LocalIcebergTable should set 'tableStats_' based on Iceberg metadata.

For tables without delete files we should always use Iceberg stats because it will be precise whereas HMS stats can be stale.

For tables with delete files the HMS stats might be more precise as they take the deleted rows into account.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)