You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "sivabalan narayanan (Jira)" <ji...@apache.org> on 2022/03/03 15:52:00 UTC

[jira] [Closed] (HUDI-2364) Run compaction without user schema file provided

     [ https://issues.apache.org/jira/browse/HUDI-2364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

sivabalan narayanan closed HUDI-2364.
-------------------------------------
    Resolution: Fixed

> Run compaction without user schema file provided
> ------------------------------------------------
>
>                 Key: HUDI-2364
>                 URL: https://issues.apache.org/jira/browse/HUDI-2364
>             Project: Apache Hudi
>          Issue Type: New Feature
>            Reporter: Wenning Ding
>            Priority: Major
>              Labels: core-flow-ds, pull-request-available, sev:normal
>             Fix For: 0.11.0
>
>
> Currently to run Hudi compaction manually, customers have to pass the avsc file of data schema by themselves,
>  e.g. in Hudi CLI,
>  
> {{}}
> {code:java}
> compaction run --compactionInstant 20201203005420 \ --parallelism 2 --sparkMemory 2G \ --schemaFilePath s3://xxx/hudi/mor_schema.avsc \ --propsFilePath file:///home/hadoop/config.properties --retry 1
> {code}
> Let customers provide avsc file is not a good option. Some customers don’t know how to generate this schema file, and some customers pass the wrong schema file and get other exceptions. We should handle this logic inside Hudi if possible.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)