You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Paul Lin (Jira)" <ji...@apache.org> on 2021/01/11 03:36:00 UTC
[jira] [Commented] (FLINK-20918) Avoid excessive flush of Hadoop
output stream
[ https://issues.apache.org/jira/browse/FLINK-20918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17262369#comment-17262369 ]
Paul Lin commented on FLINK-20918:
----------------------------------
I'd like to take this issue if we agree.
> Avoid excessive flush of Hadoop output stream
> ---------------------------------------------
>
> Key: FLINK-20918
> URL: https://issues.apache.org/jira/browse/FLINK-20918
> Project: Flink
> Issue Type: Bug
> Affects Versions: 1.12.0, 1.11.3
> Reporter: Paul Lin
> Priority: Major
>
> [HadoopRecoverableFsDataOutputStream#sync|https://github.com/apache/flink/blob/67d167ccd45046fc5ed222ac1f1e3ba5e6ec434b/flink-filesystems/flink-hadoop-fs/src/main/java/org/apache/flink/runtime/fs/hdfs/HadoopRecoverableFsDataOutputStream.java#L123] calls both `hflush` and `hsync`, whereas `hsync` is an enhanced version of `hflush`. We should remove the `hflush` call to avoid the excessive flush.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)