You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/09/07 03:10:00 UTC
[jira] [Updated] (HUDI-4794) add an option of the log file block size
[ https://issues.apache.org/jira/browse/HUDI-4794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
ASF GitHub Bot updated HUDI-4794:
---------------------------------
Labels: pull-request-available (was: )
> add an option of the log file block size
> ----------------------------------------
>
> Key: HUDI-4794
> URL: https://issues.apache.org/jira/browse/HUDI-4794
> Project: Apache Hudi
> Issue Type: Improvement
> Reporter: zhaoyangming
> Priority: Major
> Labels: pull-request-available
>
> on each hoodie log append, hdfs used will be increased with the length of the block(512M), not teh actual data length().
> Consider in a scenario,I use many writers to append concurrently to a large number of files(bucket file),but each time I append only 10 bytes.
> dfs used will be increased with the length of the block(512M),this will cause the datanode to report in-sufficient disk space on data write.
> even though it related to HDFS, We should also have the option to modify the configuration.It helps reduce the rate of increase during the du.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)