You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Rajesh Balamohan (Jira)" <ji...@apache.org> on 2023/01/31 06:43:00 UTC
[jira] [Created] (HIVE-27005) Iceberg: Col stats are not used in queries
Rajesh Balamohan created HIVE-27005:
---------------------------------------
Summary: Iceberg: Col stats are not used in queries
Key: HIVE-27005
URL: https://issues.apache.org/jira/browse/HIVE-27005
Project: Hive
Issue Type: Improvement
Components: Iceberg integration
Reporter: Rajesh Balamohan
Attachments: col_stats.txt
1. Though, insert-queries compute colstats during runtime, they are not persisted in HMS during final call.
2. Due to #1, col stats are not available during runtime for hive queries. This includes col stats, NDV etc. So unless users explicitly run "analyse table" statements, queries can be have suboptimal plans.
E.g [col_stats.txt{^}!https://jira.cloudera.com/images/icons/link_attachment_7.gif|width=7,height=7!{^}|https://jira.cloudera.com/secure/attachment/658390/658390_col_stats.txt](note that there is no col stats being used)
--
This message was sent by Atlassian Jira
(v8.20.10#820010)