You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/03/02 21:39:00 UTC

[jira] [Work logged] (HIVE-26000) DirectSQL to pruning partitions fails with postgres backend for Skewed-Partition tables

     [ https://issues.apache.org/jira/browse/HIVE-26000?focusedWorklogId=735609&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-735609 ]

ASF GitHub Bot logged work on HIVE-26000:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 02/Mar/22 21:38
            Start Date: 02/Mar/22 21:38
    Worklog Time Spent: 10m 
      Work Description: nareshpr opened a new pull request #3073:
URL: https://github.com/apache/hive/pull/3073


   ### What changes were proposed in this pull request?
   PartitionPruning via directSql is failing in postgres db for skewed tables
   
   ### Why are the changes needed?
   Fallback to ORM is taking long time
   
   ### Does this PR introduce _any_ user-facing change?
   No
   
   ### How was this patch tested?
   We already have a testcase covering the issue (list_bucket_dml_4.q), it happens in postgres backend db.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscribe@hive.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

            Worklog Id:     (was: 735609)
    Remaining Estimate: 0h
            Time Spent: 10m

> DirectSQL to pruning partitions fails with postgres backend for Skewed-Partition tables
> ---------------------------------------------------------------------------------------
>
>                 Key: HIVE-26000
>                 URL: https://issues.apache.org/jira/browse/HIVE-26000
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Naresh P R
>            Assignee: Naresh P R
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
>  
>  
> {code:java}
> 2022-03-02 20:37:56,421 INFO  org.apache.hadoop.hive.metastore.PartFilterExprUtil: [pool-6-thread-200]: Unable to make the expression tree from expression string [((ds = '2008-04-08') and (UDFToDouble(hr) = 11.0D))]Error parsing partition filter; lexer error: null; exception NoViableAltException(24@[])
> 2022-03-02 20:37:56,593 WARN  org.apache.hadoop.hive.metastore.ObjectStore: [pool-6-thread-200]: Falling back to ORM path due to direct SQL failure (this is not an error): Error executing SQL query "select "SKEWED_COL_VALUE_LOC_MAP"."SD_ID", "SKEWED_STRING_LIST_VALUES".STRING_LIST_ID, "SKEWED_COL_VALUE_LOC_MAP"."LOCATION", "SKEWED_STRING_LIST_VALUES"."STRING_LIST_VALUE" from "SKEWED_COL_VALUE_LOC_MAP"  left outer join "SKEWED_STRING_LIST_VALUES" on "SKEWED_COL_VALUE_LOC_MAP"."STRING_LIST_ID_KID" = "SKEWED_STRING_LIST_VALUES"."STRING_LIST_ID" where "SKEWED_COL_VALUE_LOC_MAP"."SD_ID" in (51010)  and "SKEWED_COL_VALUE_LOC_MAP"."STRING_LIST_ID_KID" is not null order by "SKEWED_COL_VALUE_LOC_MAP"."SD_ID" asc,  "SKEWED_STRING_LIST_VALUES"."STRING_LIST_ID" asc,  "SKEWED_STRING_LIST_VALUES"."INTEGER_IDX" asc". at org.datanucleus.api.jdo.NucleusJDOHelper.getJDOExceptionForNucleusException(NucleusJDOHelper.java:543) at org.datanucleus.api.jdo.JDOQuery.executeInternal(JDOQuery.java:391) at org.datanucleus.api.jdo.JDOQuery.execute(JDOQuery.java:216) at org.apache.hadoop.hive.metastore.MetastoreDirectSqlUtils.loopJoinOrderedResult(MetastoreDirectSqlUtils.java:131) at org.apache.hadoop.hive.metastore.MetastoreDirectSqlUtils.loopJoinOrderedResult(MetastoreDirectSqlUtils.java:109) at org.apache.hadoop.hive.metastore.MetastoreDirectSqlUtils.setSkewedColLocationMaps(MetastoreDirectSqlUtils.java:414) at org.apache.hadoop.hive.metastore.MetaStoreDirectSql.getPartitionsFromPartitionIds(MetaStoreDirectSql.java:967) at org.apache.hadoop.hive.metastore.MetaStoreDirectSql.getPartitionsFromPartitionIds(MetaStoreDirectSql.java:788) at org.apache.hadoop.hive.metastore.MetaStoreDirectSql.access$300(MetaStoreDirectSql.java:117) at org.apache.hadoop.hive.metastore.MetaStoreDirectSql$1.run(MetaStoreDirectSql.java:530) at org.apache.hadoop.hive.metastore.Batchable.runBatched(Batchable.java:73) at org.apache.hadoop.hive.metastore.MetaStoreDirectSql.getPartitionsViaSqlFilter(MetaStoreDirectSql.java:521) at org.apache.hadoop.hive.metastore.ObjectStore$10.getSqlResult(ObjectStore.java:3722);
>  Caused by: ERROR: column SKEWED_STRING_LIST_VALUES.string_list_id does not exist
> {code}
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)