You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "sivabalan narayanan (Jira)" <ji...@apache.org> on 2022/02/08 14:50:00 UTC
[jira] [Created] (HUDI-3391) presto and hive beeline fails to read MOR table w/ 2 or more array fields
sivabalan narayanan created HUDI-3391:
-----------------------------------------
Summary: presto and hive beeline fails to read MOR table w/ 2 or more array fields
Key: HUDI-3391
URL: https://issues.apache.org/jira/browse/HUDI-3391
Project: Apache Hudi
Issue Type: Task
Components: reader-core
Reporter: sivabalan narayanan
We have an issue reported by user [here|[https://github.com/apache/hudi/issues/2657].] Looks like w/ 0.10.0 or later, spark datasource read works, but hive beeline does not work. Even spark.sql (hive table) querying works as well.
Another related ticket: [https://github.com/apache/hudi/issues/3834#issuecomment-997307677]
Steps that I tried:
[https://gist.github.com/nsivabalan/fdb8794104181f93b9268380c7f7f079]
From beeline, you will encounter below exception
{code:java}
Failed with exception java.io.IOException:org.apache.hudi.org.apache.avro.SchemaParseException: Can't redefine: array {code}
All linked ticket states that upgrading parquet to 1.11.0 or greater should work. We need to try it out w/ latest master and go from there.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)