You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "ASF subversion and git services (Jira)" <ji...@apache.org> on 2022/11/07 21:28:00 UTC

[jira] [Commented] (IMPALA-11516) Create better cardinality estimates for Iceberg V2 tables with deletes

    [ https://issues.apache.org/jira/browse/IMPALA-11516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17630035#comment-17630035 ] 

ASF subversion and git services commented on IMPALA-11516:
----------------------------------------------------------

Commit f3504566fb97719eec81771a61785cedc85ba6fa in impala's branch refs/heads/master from LPL
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=f3504566f ]

IMPALA-11681: Set table stats for the Iceberg table by it's partition stats

For the Iceberg tables, table-level statistics such as numRows can be
computed according to iceberg parition stats, which is more accurate and
real-time. Obtaining these statistics is independent of
StatsSetupConst.ROW_COUNT and StatsSetupConst.TOTAL_SIZE in HMS. This is
an improvement for estimating the cardinality of the Iceberg tables.
But now the calculation of V2 Iceberg table is not accurate, maybe after
IMPALA-11516(Return better partition stats for V2 tables) is ready, they
can be considered to replace those MHS statistics.

Testing:
 - Existing tests
 - Test on 'On-demand Metadata' mode
 - For 'select * from
 iceberg_v2_positional_not_all_data_files_have_delete_files where i =
 (select max(i) from iceberg_v2_positional_update_all_rows)', the 'Join
 Order' and 'Distribution Mode' are the same as when table stats are
 present

Change-Id: I3e92d3f25e2a57a64556249410d0af3522598c00
Reviewed-on: http://gerrit.cloudera.org:8080/19168
Reviewed-by: Impala Public Jenkins <im...@cloudera.com>
Tested-by: Impala Public Jenkins <im...@cloudera.com>


> Create better cardinality estimates for Iceberg V2 tables with deletes
> ----------------------------------------------------------------------
>
>                 Key: IMPALA-11516
>                 URL: https://issues.apache.org/jira/browse/IMPALA-11516
>             Project: IMPALA
>          Issue Type: Sub-task
>          Components: Frontend
>            Reporter: Zoltán Borók-Nagy
>            Assignee: Zoltán Borók-Nagy
>            Priority: Major
>              Labels: impala-iceberg
>
> Create better cardinality estimates for Iceberg V2 tables with deletes.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org