You are viewing a plain text version of this content. The canonical link for it is here.

Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2021/02/02 08:54:41 UTC

[GitHub] [spark] dongjoon-hyun commented on a change in pull request #31133: [SPARK-26836][SQL] Supporting Avro schema evolution for partitioned Hive tables

dongjoon-hyun commented on a change in pull request #31133:
URL: https://github.com/apache/spark/pull/31133#discussion_r568424773



##########
File path: sql/hive/src/main/scala/org/apache/spark/sql/hive/TableReader.scala
##########
@@ -388,6 +394,9 @@ private[hive] object HiveTableUtil {
 private[hive] object DeserializerLock
 
 private[hive] object HadoopTableReader extends HiveInspectors with Logging {
+
+  val avroTableProperties = AvroTableProperties.values().map(_.getPropName()).toSet

Review comment:
       SPARK-26836 is filed for 2.3.1 ~ 3.2.0 and this patch depends on `org.apache.hadoop.hive.serde2.avro.AvroSerdeUtils.AvroTableProperties`. I'm wondering if this patch is applicable with old Spark branches with old Hive libraries. For example,
   - Apache Spark master : 2.3.8
   - Apache Spark branch-3.1: 2.3.7
   - Apache Spark branch-3.0: 2.3.7 and 1.2.1.spark2
   - Apache Spark branch-2.4: 1.2.1.spark2
   
   What is the minimum version of Hive library to use this? Is there a known bug for this `AvroTableProperties` classes?




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org