You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Raymond Xu (Jira)" <ji...@apache.org> on 2022/09/07 15:28:00 UTC

[jira] [Updated] (HUDI-4626) Partitioning table by `_hoodie_partition_path` fails

     [ https://issues.apache.org/jira/browse/HUDI-4626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Raymond Xu updated HUDI-4626:
-----------------------------
    Sprint: 2022/08/22, 2022/09/05  (was: 2022/08/22)

> Partitioning table by `_hoodie_partition_path` fails
> ----------------------------------------------------
>
>                 Key: HUDI-4626
>                 URL: https://issues.apache.org/jira/browse/HUDI-4626
>             Project: Apache Hudi
>          Issue Type: Bug
>    Affects Versions: 0.12.0
>            Reporter: Alexey Kudinkin
>            Assignee: Alexey Kudinkin
>            Priority: Blocker
>             Fix For: 0.12.1
>
>
>  
> Currently, creating a table partitioned by "_hoodie_partition_path" using Glue catalog fails w/ the following exception:
> {code:java}
> AnalysisException: Found duplicate column(s) in the data schema and the partition schema: _hoodie_partition_path
> {code}
> Using following DDL:
> {code:java}
> CREATE EXTERNAL TABLE `active_storage_attachments`(  `_hoodie_commit_time` string COMMENT '',   `_hoodie_commit_seqno` string COMMENT '',   `_hoodie_record_key` string COMMENT '',   `_hoodie_file_name` string COMMENT '',   `_change_operation_type` string COMMENT '',   `_upstream_event_processed_ts_ms` bigint COMMENT '',   `db_shard_source_partition` string COMMENT '',   `_event_origin_ts_ms` bigint COMMENT '',   `_event_tx_id` bigint COMMENT '',   `_event_lsn` bigint COMMENT '',   `_event_xmin` bigint COMMENT '',   `id` bigint COMMENT '',   `name` string COMMENT '',   `record_type` string COMMENT '',   `record_id` bigint COMMENT '',   `blob_id` bigint COMMENT '',   `created_at` timestamp COMMENT '')PARTITIONED BY (   `_hoodie_partition_path` string COMMENT '')ROW FORMAT SERDE   'org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe' WITH SERDEPROPERTIES (   'hoodie.query.as.ro.table'='false',   'path'='...') STORED AS INPUTFORMAT   'org.apache.hudi.hadoop.HoodieParquetInputFormat' OUTPUTFORMAT   'org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat'LOCATION  '...'
> TBLPROPERTIES (  'spark.sql.sources.provider'='hudi' )
>  {code}
>  
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)