You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Ádám Szita (Jira)" <ji...@apache.org> on 2020/02/26 09:18:00 UTC
[jira] [Updated] (HIVE-22931) HoS dynamic partitioning fails with
blobstore optimizations off
[ https://issues.apache.org/jira/browse/HIVE-22931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ádám Szita updated HIVE-22931:
------------------------------
Description:
Reproduction steps:
- Create s3a backed table and normal table.
{code:java}
CREATE TABLE source (
a string,
b int,
c int);
CREATE TABLE target (
a string)
PARTITIONED BY (
b int,
c int)
STORED AS parquet
LOCATION
's3a://somepath';
{code}
- Insert values into normal table.
{code:java}
INSERT INTO TABLE source VALUES ("a", "1", "1");
{code}
- Do an insert overwrite with dynamic partitions:
{code:java}
set hive.exec.dynamic.partition.mode=nonstrict;
set hive.blobstore.optimizations.enabled=false;
set hive.execution.engine=spark;
INSERT OVERWRITE TABLE target partition (b,c)
SELECT *
FROM source;{code}
This fails only with Spark execution engine + blobstorage optimizations being turned off with:
{code}
2020-01-16 15:24:56,064 ERROR hive.ql.metadata.Hive: [load-dynamic-partitions-5]: Exception when loading partition with parameters partPath=hdfs://nameservice1/tmp/hive/hive/6bcee075-b637-429e-9bf0-a2658355415e/hive_2020-01-16_15-24-01_156_4299941251929377815-4/-mr-10000/.hive-staging_hive_2020-01-16_15-24-01_156_4299941251929377815-4/-ext-10002, table=email_click_base, partSpec={b=null, c=null}, replace=true, listBucketingEnabled=false, isAcid=false, hasFollowingStatsTask=trueorg.apache.hadoop.hive.ql.metadata.HiveException: MetaException(message:Partition spec is incorrect. {companyid=null, eventmonth=null})
at org.apache.hadoop.hive.ql.metadata.Hive.loadPartitionInternal(Hive.java:1666)
{code}
was:
Reproduction steps:
- Create s3a backed table and normal table.
{code:java}
CREATE TABLE source (
a string,
b int,
c int);
CREATE TABLE target (
a string)
PARTITIONED BY (
b int,
c int)
STORED AS parquet
LOCATION
's3a://somepath';
{code}
- Insert values into normal table.
{code:java}
INSERT INTO TABLE source VALUES ("a", "1", "1");
{code}
- Do an insert overwrite with dynamic partitions:
{code:java}
set hive.exec.dynamic.partition.mode=nonstrict;
set hive.blobstore.optimizations.enabled=false;
set hive.execution.engine=spark;
INSERT OVERWRITE TABLE target partition (b,c)
SELECT *
FROM source;{code}
This fails only with Spark execution engine + blobstorage optimizations being turned off with:
2020-01-16 15:24:56,064 ERROR hive.ql.metadata.Hive: [load-dynamic-partitions-5]: Exception when loading partition with parameters partPath=hdfs://nameservice1/tmp/hive/hive/6bcee075-b637-429e-9bf0-a2658355415e/hive_2020-01-16_15-24-01_156_4299941251929377815-4/-mr-10000/.hive-staging_hive_2020-01-16_15-24-01_156_4299941251929377815-4/-ext-10002, table=email_click_base, partSpec={b=null, c=null}, replace=true, listBucketingEnabled=false, isAcid=false, hasFollowingStatsTask=trueorg.apache.hadoop.hive.ql.metadata.HiveException: MetaException(message:Partition spec is incorrect. {companyid=null, eventmonth=null})
at org.apache.hadoop.hive.ql.metadata.Hive.loadPartitionInternal(Hive.java:1666)
> HoS dynamic partitioning fails with blobstore optimizations off
> ---------------------------------------------------------------
>
> Key: HIVE-22931
> URL: https://issues.apache.org/jira/browse/HIVE-22931
> Project: Hive
> Issue Type: Bug
> Reporter: Ádám Szita
> Assignee: Ádám Szita
> Priority: Major
>
> Reproduction steps:
> - Create s3a backed table and normal table.
> {code:java}
> CREATE TABLE source (
> a string,
> b int,
> c int);
>
> CREATE TABLE target (
> a string)
> PARTITIONED BY (
> b int,
> c int)
> STORED AS parquet
> LOCATION
> 's3a://somepath';
> {code}
> - Insert values into normal table.
> {code:java}
> INSERT INTO TABLE source VALUES ("a", "1", "1");
> {code}
> - Do an insert overwrite with dynamic partitions:
> {code:java}
> set hive.exec.dynamic.partition.mode=nonstrict;
> set hive.blobstore.optimizations.enabled=false;
> set hive.execution.engine=spark;
> INSERT OVERWRITE TABLE target partition (b,c)
> SELECT *
> FROM source;{code}
> This fails only with Spark execution engine + blobstorage optimizations being turned off with:
> {code}
> 2020-01-16 15:24:56,064 ERROR hive.ql.metadata.Hive: [load-dynamic-partitions-5]: Exception when loading partition with parameters partPath=hdfs://nameservice1/tmp/hive/hive/6bcee075-b637-429e-9bf0-a2658355415e/hive_2020-01-16_15-24-01_156_4299941251929377815-4/-mr-10000/.hive-staging_hive_2020-01-16_15-24-01_156_4299941251929377815-4/-ext-10002, table=email_click_base, partSpec={b=null, c=null}, replace=true, listBucketingEnabled=false, isAcid=false, hasFollowingStatsTask=trueorg.apache.hadoop.hive.ql.metadata.HiveException: MetaException(message:Partition spec is incorrect. {companyid=null, eventmonth=null})
> at org.apache.hadoop.hive.ql.metadata.Hive.loadPartitionInternal(Hive.java:1666)
> {code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)