You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Wenning Ding (Jira)" <ji...@apache.org> on 2021/08/25 23:24:00 UTC

[jira] [Created] (HUDI-2364) Run compaction without user schema file provided

Wenning Ding created HUDI-2364:
----------------------------------

             Summary: Run compaction without user schema file provided
                 Key: HUDI-2364
                 URL: https://issues.apache.org/jira/browse/HUDI-2364
             Project: Apache Hudi
          Issue Type: New Feature
            Reporter: Wenning Ding


Currently to run Hudi compaction manually, customers have to pass the avsc file of data schema by themselves,
e.g. in Hudi CLI,

 

{{}}
{code:java}
compaction run --compactionInstant 20201203005420 \ --parallelism 2 --sparkMemory 2G \ --schemaFilePath s3://wenningd-emr-dev/oncall/hudi/mor_delete_2_schema.avsc \ --propsFilePath file:///home/hadoop/config.properties --retry 1
{code}
Let customers provide avsc file is not a good option. Some customers don’t know how to generate this schema file, and some customers pass the wrong schema file and get other exceptions. We should handle this logic inside Hudi if possible.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)