You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Rajesh Balamohan (Jira)" <ji...@apache.org> on 2023/01/31 06:43:00 UTC

[jira] [Created] (HIVE-27005) Iceberg: Col stats are not used in queries

Rajesh Balamohan created HIVE-27005:
---------------------------------------

             Summary: Iceberg: Col stats are not used in queries
                 Key: HIVE-27005
                 URL: https://issues.apache.org/jira/browse/HIVE-27005
             Project: Hive
          Issue Type: Improvement
          Components: Iceberg integration
            Reporter: Rajesh Balamohan
         Attachments: col_stats.txt

1. Though, insert-queries compute colstats during runtime, they are not persisted in HMS during final call. 

2. Due to #1, col stats are not available during runtime for hive queries. This includes col stats, NDV etc. So unless users explicitly run "analyse table" statements, queries can be have suboptimal plans.

E.g [col_stats.txt{^}!https://jira.cloudera.com/images/icons/link_attachment_7.gif|width=7,height=7!{^}|https://jira.cloudera.com/secure/attachment/658390/658390_col_stats.txt](note that there is no col stats being used)



--
This message was sent by Atlassian Jira
(v8.20.10#820010)