You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2022/04/22 19:46:49 UTC

[GitHub] [hudi] qjqqyy commented on issue #5101: [SUPPORT] Deltastreamer: setting hoodie.datasource.hive_sync.partition_fields to blank results in empty partition being used in hive sync

qjqqyy commented on issue #5101:
URL: https://github.com/apache/hudi/issues/5101#issuecomment-1106809193

   hi @codope sure I would like to work on the fix, please add me (apache username jqi) as a contributor.
   
   So to to summarise the problem at hand, **ComplexKeyGenerator + no partition columns + HiveSync results in a broken Hive table**.
   * for non-partitoned table, ComplexKeyGenerator requires `hoodie.datasource.write.partitionpath.field` to be set to empty string
   * when `hoodie.datasource.hive_sync.partition_fields` is not explicitly set, the inference function sets it to empty string
     - as OP is using Hudi 0.10.1, which does not auto-infer, it is just a minor annoyance of having to configure things inconsistently
   * there does not exist a combination of flags to produce a working hive table
   
   I think the root cause is actually inconsistent handling of the degenerate case (empty string) between TypedProperties#getStringList and ComplexKeyGenerator.
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org