You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2022/04/18 12:01:43 UTC

[GitHub] [hudi] Aload commented on issue #4825: [SUPPORT] flink hudi some class not found

Aload commented on issue #4825:
URL: https://github.com/apache/hudi/issues/4825#issuecomment-1101345557

   hi , When I use Flink to consume Kakfa and write hudi, I configure the hive synchronization operation.  The same problem occurred during startup .
   When I looked through the hivesyncContext.java source code, I noticed that the package imported under the source code was very strange.  But this is normal when I don't synchronize hive.
   
   
   
   version: hudi 0.10.1  flink 1.13.1  scala 2.12.10
   eg:
   `  private val sinkSql: String = SinkSql.apply(sinkTableName, dataType.getLogicalType.asInstanceOf[RowType])
       .option(FlinkOptions.PATH, hoodiePath.concat(sinkDbName).concat("/").concat(sinkTableName).concat("/"))
       .option(FlinkOptions.PRECOMBINE_FIELD, "receiveTime")
       .option(FlinkOptions.RECORD_KEY_FIELD, "tenantId,pointId,sn,collectTime")
       .option(FlinkOptions.READ_AS_STREAMING, true)
       .option(FlinkOptions.READ_START_COMMIT, "earliest")
       .option(FlinkOptions.OPERATION, WriteOperationType.INSERT)
       .option(FlinkOptions.WRITE_BULK_INSERT_SHUFFLE_BY_PARTITION, true)
       .option(FlinkOptions.TABLE_TYPE, HoodieTableType.COPY_ON_WRITE)
       .option(FlinkOptions.BUCKET_ASSIGN_TASKS, 10)
       .option(FlinkOptions.COMPACTION_ASYNC_ENABLED, true)
       .option(FlinkOptions.COMPACTION_DELTA_COMMITS, 1)
       .option(FlinkOptions.READ_STREAMING_CHECK_INTERVAL, 10)
       //    .option(FlinkOptions.INSERT_CLUSTER, true)
       .option(FlinkOptions.RETRY_TIMES, 5)
       .option(FlinkOptions.INSERT_CLUSTER, true)
       .option(FlinkOptions.WRITE_TASKS, 10)
       .option(FlinkOptions.HIVE_SYNC_ENABLED, true)
       .option(FlinkOptions.HIVE_SYNC_AUTO_CREATE_DB, true)
       .option(FlinkOptions.HIVE_SYNC_DB, "ods")
       .option(FlinkOptions.HIVE_SYNC_TABLE, sinkTableName)
       .option(FlinkOptions.HIVE_SYNC_MODE, "hms")
       .option(FlinkOptions.HIVE_SYNC_METASTORE_URIS, "thrift://dev32:9083")
       .option(FlinkOptions.HIVE_SYNC_JDBC_URL, "jdbc:hive2://dev32:10000")
       .option(FlinkOptions.HIVE_STYLE_PARTITIONING, true)
       .option(FlinkOptions.HIVE_SYNC_SUPPORT_TIMESTAMP, true)
       .partitionField("tenantId", "fmy", "fmm", "fmd")
       //      .option(FlinkOptions.INDEX_GLOBAL_ENABLED, true)
       .end`
    
   ![image](https://user-images.githubusercontent.com/13082598/163804951-28bb43b7-d75b-4b30-bc1a-8cbfacb89b9f.png)
   ![image](https://user-images.githubusercontent.com/13082598/163805146-700dea3a-2813-4bef-92e0-0218e017505f.png)
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org