You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Udit Mehrotra (Jira)" <ji...@apache.org> on 2021/08/25 08:51:00 UTC
[jira] [Updated] (HUDI-907) Test Presto mor query support changes
in HDFS Env
[ https://issues.apache.org/jira/browse/HUDI-907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Udit Mehrotra updated HUDI-907:
-------------------------------
Fix Version/s: (was: 0.9.0)
0.10.0
> Test Presto mor query support changes in HDFS Env
> -------------------------------------------------
>
> Key: HUDI-907
> URL: https://issues.apache.org/jira/browse/HUDI-907
> Project: Apache Hudi
> Issue Type: Sub-task
> Components: Presto Integration
> Affects Versions: 0.9.0
> Reporter: Bhavani Sudha
> Assignee: Bhavani Sudha
> Priority: Major
> Fix For: 0.10.0
>
>
> Test presto integration for HDFS environment as well in addition to S3.
>
> Blockers faced so far
> [~bdscheller] I tried to apply your presto patch to test mor queries on Presto. The way I set it up was create a docker image from your presto patch and use that image in hudi local docker environment. I observed couple of issues there:
> * I got NoClassDefFoundError for these classes:
> ** org/apache/parquet/avro/AvroSchemaConverter
> ** org/apache/parquet/hadoop/ParquetFileReader
> ** org/apache/parquet/io/InputFile
> ** org/apache/parquet/format/TypeDefinedOrder
> I was able to get around the first three errors by shading org.apache.parquet inside hudi-presto-bundle and changing presto-hive to depend on the hudi-presto-bundle. However, for the last one shading dint help because its already a Thrift generated class. I am wondering you also ran into similar issues while testing S3.
> Could you please elaborate your test set up so we can do similar thing for HDFS as well. If we need to add more changes to hudi-presto-bundle, we would need to prioritize that for 0.5.3 release asap.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)