You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Stamatis Zampetakis (Jira)" <ji...@apache.org> on 2022/10/21 07:21:01 UTC

[jira] [Updated] (HIVE-25867) Partition filter condition should pushed down to metastore query if it is equivalence Predicate

     [ https://issues.apache.org/jira/browse/HIVE-25867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Stamatis Zampetakis updated HIVE-25867:
---------------------------------------
    Fix Version/s:     (was: 4.0.0)

I cleared the fixVersion field since this ticket is still open. Please review this ticket and if the fix is already committed to a specific version please set the version accordingly and mark the ticket as RESOLVED.

According to the [JIRA guidelines|https://cwiki.apache.org/confluence/display/Hive/HowToContribute] the fixVersion should be set only when the issue is resolved/closed.

> Partition filter condition should pushed down to metastore query if it is equivalence Predicate
> -----------------------------------------------------------------------------------------------
>
>                 Key: HIVE-25867
>                 URL: https://issues.apache.org/jira/browse/HIVE-25867
>             Project: Hive
>          Issue Type: Improvement
>          Components: Standalone Metastore
>            Reporter: shezm
>            Assignee: shezm
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 2.5h
>  Remaining Estimate: 0h
>
> The colnum type of the partition is different from the column type of the hql query, the metastore will not push down the query to the RDBMS, but will instead get all PARTITIONS.PART_NAME of the hive table then filter it according to the hql Expression. 
> https://github.com/apache/hive/blob/5b112aa6dcc4e374c0a7c2b24042f24ae6815da1/standalone-metastore/metastore-server/src/main/java/org/apache/hadoop/hive/metastore/MetaStoreDirectSql.java#L1316
> If the hive table has too many partitions and there are multiple hql queries at the same time,RDBMS will increasing CPU IO_WAIT and affect performance.
> If the partition filter condition in hql is an equivalent predicate, the metastore should be pushed down to RDBMS, which can optimize the query performance of hive large tables.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)