You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Danny Chen (Jira)" <ji...@apache.org> on 2023/01/13 09:51:00 UTC

[jira] [Updated] (HUDI-5544) Improve log msgs during bulk insert

     [ https://issues.apache.org/jira/browse/HUDI-5544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Danny Chen updated HUDI-5544:
-----------------------------
    Fix Version/s: 0.13.0

> Improve log msgs during bulk insert
> -----------------------------------
>
>                 Key: HUDI-5544
>                 URL: https://issues.apache.org/jira/browse/HUDI-5544
>             Project: Apache Hudi
>          Issue Type: Improvement
>          Components: flink
>            Reporter: Kenneth William Krugler
>            Priority: Minor
>              Labels: pull-request-available
>             Fix For: 0.13.0
>
>
> Currently a log msg that says "Creating new file for partition path" is generated every time the current partition changes, even when no new file is being created (which is confusing).
> Also if you have data flowing from multiple topics, with even a slight lag between them, you can wind up with current partition bouncing back and forth when the event time is close to the partition boundary. This in turn generates (in our case) upwards of 1M messages, given our data rate is between 5 and 20M records/second.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)