You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2019/05/31 02:02:54 UTC

[GitHub] [incubator-hudi] cdmikechen opened a new pull request #707: fix jar problem when using spark-shell

cdmikechen opened a new pull request #707:  fix jar problem when using spark-shell
URL: https://github.com/apache/incubator-hudi/pull/707
 
 
   jira link https://issues.apache.org/jira/browse/HUDI-101
   issue link apache#516 (comment)
   
   when using spark-shell with hoodie saving data like :
   ```
   ./spark-shell --master yarn --jars /home/hdfs/software/spark/hoodie/hoodie-spark-bundle-0.4.8-SNAPSHOT.jar --conf spark.sql.hive.convertMetastoreParquet=false --packages com.databricks:spark-avro_2.11:4.0.0
   ```
   ```
         inputDF.write.format("com.uber.hoodie")
           .option("hoodie.insert.shuffle.parallelism", "1") 
           .option("hoodie.upsert.shuffle.parallelism", "1")
           .option(DataSourceWriteOptions.STORAGE_TYPE_OPT_KEY, HoodieTableType.COPY_ON_WRITE.name())
           .option(DataSourceWriteOptions.OPERATION_OPT_KEY, DataSourceWriteOptions.UPSERT_OPERATION_OPT_VAL) // insert
           .option(DataSourceWriteOptions.RECORDKEY_FIELD_OPT_KEY, "_row_key")
           .option(DataSourceWriteOptions.PARTITIONPATH_FIELD_OPT_KEY, "partition")
           .option(DataSourceWriteOptions.PRECOMBINE_FIELD_OPT_KEY, "extend_deal_date")
           .option(HoodieWriteConfig.TABLE_NAME, "c_upload_code")
           .mode(SaveMode.Overwrite)
           .save("/tmp/test/hoodie")
   ```
   It also report error  `Invalid signature file digest for Manifest main attributes`. Need to scan all infected dependency.
   My Test version is Hadoop 2.8.5 and spark  2.3.3. Compiling version is Hadoop 2.7.3, spark 2.3.3 or 2.2.1

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services