You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "michael1991 (via GitHub)" <gi...@apache.org> on 2023/03/08 08:19:25 UTC

[GitHub] [hudi] michael1991 commented on issue #8121: [SUPPORT] MOR Table Duplicated Records Found

michael1991 commented on issue #8121:
URL: https://github.com/apache/hudi/issues/8121#issuecomment-1459708223

   > Can you give the job configurations here? cc @nsivabalan , maybe you can take a look~
   
   Sure, pls see configurations as below:
   ```scala
   val COMMON_HUDI_CONF_MAP = Map(RECORDKEY_FIELD.key() -> "id", PRECOMBINE_FIELD.key() -> "id",
       SCHEMA_EVOLUTION_ENABLED.key() -> "true", DATABASE_NAME.key() -> "database",
       COMBINE_BEFORE_UPSERT.key() -> "false", EMBEDDED_TIMELINE_SERVER_REUSE_ENABLED.key() -> "true",
       INSERT_PARALLELISM_VALUE.key() -> "5", UPSERT_PARALLELISM_VALUE.key() -> "5",
       CLEANER_COMMITS_RETAINED.key() -> "2", ASYNC_CLEAN.key() -> "true", 
       PARTITIONPATH_FIELD.key() -> "date,hour", TBL_NAME.key() -> TBL_LOG_INCREMENT_DETAILS_NAME,
       TABLE_TYPE.key() -> MOR_TABLE_TYPE_OPT_VAL, OPERATION.key() -> UPSERT_OPERATION_OPT_VAL,
       WRITE_PAYLOAD_CLASS_NAME.key() -> CUSTOM_PAYLOAD_CLASS, INLINE_COMPACT.key() -> "true")
   
   // This is current configuration, previous is only changed INLINE_COMPACT to SCHEDULE_INLINE_COMPACT.
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org