You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "sivabalan narayanan (Jira)" <ji...@apache.org> on 2021/11/10 03:53:00 UTC
[jira] [Resolved] (HUDI-2579) Deltastreamer checkpoint metadata is
not merged from previous commit instant
[ https://issues.apache.org/jira/browse/HUDI-2579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
sivabalan narayanan resolved HUDI-2579.
---------------------------------------
> Deltastreamer checkpoint metadata is not merged from previous commit instant
> ----------------------------------------------------------------------------
>
> Key: HUDI-2579
> URL: https://issues.apache.org/jira/browse/HUDI-2579
> Project: Apache Hudi
> Issue Type: Sub-task
> Components: DeltaStreamer, Spark Integration, Writer Core
> Reporter: Dave Hagman
> Assignee: Dave Hagman
> Priority: Blocker
> Labels: pull-request-available
> Fix For: 0.10.0
>
>
> Non-deltastreamer writers are supposed to copy over checkpoint metadata from previous checkpoints if the config _*hoodie.write.meta.key.prefixes = 'deltastreamer.checkpoint.key'*_ is set. This does not happen which causes non-deltastreamer writers to corrupt the deltastreamer commits by essentially erasing the checkpoint metadata.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)