You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Prashant Wason (Jira)" <ji...@apache.org> on 2020/04/03 23:52:00 UTC

[jira] [Assigned] (HUDI-757) Add a command to hudi-cli to export commit metadata

     [ https://issues.apache.org/jira/browse/HUDI-757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Prashant Wason reassigned HUDI-757:
-----------------------------------

    Assignee: Prashant Wason

> Add a command to hudi-cli to export commit metadata
> ---------------------------------------------------
>
>                 Key: HUDI-757
>                 URL: https://issues.apache.org/jira/browse/HUDI-757
>             Project: Apache Hudi (incubating)
>          Issue Type: Improvement
>            Reporter: Prashant Wason
>            Assignee: Prashant Wason
>            Priority: Minor
>              Labels: pull-request-available
>   Original Estimate: 4h
>          Time Spent: 10m
>  Remaining Estimate: 3h 50m
>
> HUDI stores commit related information in files within the .hoodie directory. Each commit / delatacommit / rollback / etc creates one or more files. To prevent a large number of files, older files are consolidated together and moved into a commit archive which has multiple such files written together using the format of HUDI Log files.
> During debugging of issues or for development of new features, it may be required to refer to the metadata of older commits / cleanups / rollbacks. There is no simple way to get these from a production setup especially from the archive files.
> This enhancement provides a hudi cli command which allows exporting metadata from HUDI commit archives.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)