You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@flink.apache.org by "Zhu Zhu (Jira)" <ji...@apache.org> on 2022/01/15 03:57:00 UTC

[jira] [Closed] (FLINK-25034) Support flexible number of subpartitions in IntermediateResultPartition

     [ https://issues.apache.org/jira/browse/FLINK-25034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Zhu Zhu closed FLINK-25034.
---------------------------
    Fix Version/s: 1.15.0
       Resolution: Done

master/release-1.15:
856652435b2b65933c0b09e780a6229c62a854e4
56ba36db1e3edd0f1c24c53abb2b62cdfabd9bb1

> Support flexible number of subpartitions in IntermediateResultPartition
> -----------------------------------------------------------------------
>
>                 Key: FLINK-25034
>                 URL: https://issues.apache.org/jira/browse/FLINK-25034
>             Project: Flink
>          Issue Type: Sub-task
>          Components: Runtime / Coordination
>            Reporter: Lijie Wang
>            Assignee: Lijie Wang
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 1.15.0
>
>
> Currently, when a task is deployed, it needs to know the parallelism of its consumer job vertex. This is because the consumer vertex parallelism is needed to decide the _numberOfSubpartitions_ of _PartitionDescriptor_ which is part of the {_}ResultPartitionDeploymentDescriptor{_}. The reason behind that is, at the moment, for one result partition, different subpartitions serve different consumer execution vertices. More specifically, one consumer execution vertex only consumes data from subpartition with the same index. 
> Considering a dynamic graph, the parallelism of a job vertex may not have been decided when its upstream vertices are deployed. To enable Flink to work in this case, we need a way to allow an execution vertex to run without knowing the parallelism of its consumer job vertices. One basic idea is to enable multiple subpartitions in one result partition to serve the same consumer execution vertex.
> To achieve this goal, we can set the number of subpartitions to be the *max parallelism* of the consumer job vertex. When the consumer vertex is deployed, it should be assigned with a subpartition range to consume.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)