You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "Zheng Hu (Jira)" <ji...@apache.org> on 2022/05/19 07:22:00 UTC

[jira] [Created] (FLINK-27696) Add bin-pack strategy to split the whole bucket data files into several small splits for append-only table.

Zheng Hu created FLINK-27696:
--------------------------------

             Summary: Add bin-pack strategy to split the whole bucket data files into several small splits for append-only table.
                 Key: FLINK-27696
                 URL: https://issues.apache.org/jira/browse/FLINK-27696
             Project: Flink
          Issue Type: Sub-task
            Reporter: Zheng Hu


For append-only table,  we don't have to assign each task with a whole bucket data files. Instead,  we can use some algorithm ( such as bin-packing) to split the whole bucket data files into multiple fragments  to improve the job parallelism.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)