You are viewing a plain text version of this content. The canonical link for it is here.

Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2019/04/15 03:12:43 UTC

[GitHub] [spark] wangyum opened a new pull request #24371: [SPARK-27176][FOLLOW-UP][SQL] Shade Parquet to sql/core

wangyum opened a new pull request #24371: [SPARK-27176][FOLLOW-UP][SQL] Shade Parquet to sql/core
URL: https://github.com/apache/spark/pull/24371
 
 
   ## What changes were proposed in this pull request?
   
   When we compile and test Hadoop 3.2, we will hint the following two issues:
   1. JobSummaryLevel is not a member of object org.apache.parquet.hadoop.ParquetOutputFormat. Fixed by [PARQUET-381](https://issues.apache.org/jira/browse/PARQUET-381)(Parquet 1.9.0)
   2. java.lang.NoSuchFieldError: BROTLI
       at org.apache.parquet.hadoop.metadata.CompressionCodecName.<clinit>(CompressionCodecName.java:31). Fixed by [PARQUET-1143](https://issues.apache.org/jira/browse/PARQUET-1143)(Parquet 1.10.0)
   
   The reason is that the `parquet-hadoop-bundle-1.8.1.jar` conflicts with Parquet 1.10.1. This PR shades Parquet 1.10.1 to `sql/core` to workaround this issue. After this PR, the parquet version should be:
   
     | sql/core | sql/hive
   -- | -- | --
   hadoop-2.7 | 1.10.1 | 1.6.0
   hadoop-3.2 | 1.10.1 | 1.8.1
   
   To make it simple, this PR also removes `parquet-provided` support.
   
   ## How was this patch tested?
   
   manual tests
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org