You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Zhu Zhu (Jira)" <ji...@apache.org> on 2022/06/13 09:24:00 UTC

[jira] [Updated] (FLINK-28018) the start index to create empty splits in BinaryInputFormat#createInputSplits is inappropriate

     [ https://issues.apache.org/jira/browse/FLINK-28018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Zhu Zhu updated FLINK-28018:
----------------------------
    Component/s: Formats (JSON, Avro, Parquet, ORC, SequenceFile)
                     (was: API / Core)

> the start index to create empty splits in BinaryInputFormat#createInputSplits is inappropriate
> ----------------------------------------------------------------------------------------------
>
>                 Key: FLINK-28018
>                 URL: https://issues.apache.org/jira/browse/FLINK-28018
>             Project: Flink
>          Issue Type: Bug
>          Components: Formats (JSON, Avro, Parquet, ORC, SequenceFile)
>    Affects Versions: 1.15.0, 1.14.4, 1.16.0
>            Reporter: zl
>            Priority: Major
>             Fix For: 1.16.0, 1.15.2
>
>
> when the number of created split is smaller than the minimum desired number of file splits, [BinaryInputFormat.java#L150|https://github.com/apache/flink/blob/master/flink-core/src/main/java/org/apache/flink/api/common/io/BinaryInputFormat.java#L150] use `{_}*files.size()*{_}` as the start index to create empty splits. That is inappropriate, the start index should be `{_}*inputSplits.size()*{_}`.  



--
This message was sent by Atlassian Jira
(v8.20.7#820007)