You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Ethan Guo (Jira)" <ji...@apache.org> on 2023/02/07 22:53:00 UTC
[jira] [Updated] (HUDI-4967) Improve docs for meta sync with TimestampBasedKeyGenerator
[ https://issues.apache.org/jira/browse/HUDI-4967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ethan Guo updated HUDI-4967:
----------------------------
Fix Version/s: 0.14.0
(was: 0.13.0)
> Improve docs for meta sync with TimestampBasedKeyGenerator
> ----------------------------------------------------------
>
> Key: HUDI-4967
> URL: https://issues.apache.org/jira/browse/HUDI-4967
> Project: Apache Hudi
> Issue Type: Improvement
> Reporter: Ethan Guo
> Assignee: Jonathan Vexler
> Priority: Major
> Labels: pull-request-available
> Fix For: 0.14.0
>
>
> Related fix: HUDI-4966
> We need to add docs on how to properly set the meta sync configuration, especially the hoodie.datasource.hive_sync.partition_value_extractor, in [https://hudi.apache.org/docs/key_generation] (for different Hudi versions, the config can be different). Check the ticket above and PR description of [https://github.com/apache/hudi/pull/6851] for more details.
> We should also add the migration setup on the key generation page as well: [https://hudi.apache.org/releases/release-0.12.0/#configuration-updates]
> * {{{}hoodie.datasource.hive_sync.partition_value_extractor{}}}: This config is used to extract and transform partition value during Hive sync. Its default value has been changed from {{SlashEncodedDayPartitionValueExtractor}} to {{{}MultiPartKeysValueExtractor{}}}. If you relied on the previous default value (i.e., have not set it explicitly), you are required to set the config to {{{}org.apache.hudi.hive.SlashEncodedDayPartitionValueExtractor{}}}. From this release, if this config is not set and Hive sync is enabled, then partition value extractor class will be *automatically inferred* on the basis of number of partition fields and whether or not hive style partitioning is enabled.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)