You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@flink.apache.org by "Paul Lin (Jira)" <ji...@apache.org> on 2021/01/11 03:36:00 UTC

[jira] [Commented] (FLINK-20918) Avoid excessive flush of Hadoop output stream

    [ https://issues.apache.org/jira/browse/FLINK-20918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17262369#comment-17262369 ] 

Paul Lin commented on FLINK-20918:
----------------------------------

I'd like to take this issue if we agree.

> Avoid excessive flush of Hadoop output stream
> ---------------------------------------------
>
>                 Key: FLINK-20918
>                 URL: https://issues.apache.org/jira/browse/FLINK-20918
>             Project: Flink
>          Issue Type: Bug
>    Affects Versions: 1.12.0, 1.11.3
>            Reporter: Paul Lin
>            Priority: Major
>
> [HadoopRecoverableFsDataOutputStream#sync|https://github.com/apache/flink/blob/67d167ccd45046fc5ed222ac1f1e3ba5e6ec434b/flink-filesystems/flink-hadoop-fs/src/main/java/org/apache/flink/runtime/fs/hdfs/HadoopRecoverableFsDataOutputStream.java#L123] calls both `hflush` and `hsync`, whereas `hsync` is an enhanced version of `hflush`. We should remove the `hflush` call to avoid the excessive flush.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)