You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Tzu-Li (Gordon) Tai (JIRA)" <ji...@apache.org> on 2019/07/04 07:38:00 UTC
[jira] [Assigned] (FLINK-13093) Provide an easy way to modify max
parallelism using the State Processor API
[ https://issues.apache.org/jira/browse/FLINK-13093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tzu-Li (Gordon) Tai reassigned FLINK-13093:
-------------------------------------------
Assignee: Seth Wiesman
> Provide an easy way to modify max parallelism using the State Processor API
> ---------------------------------------------------------------------------
>
> Key: FLINK-13093
> URL: https://issues.apache.org/jira/browse/FLINK-13093
> Project: Flink
> Issue Type: Sub-task
> Reporter: Tzu-Li (Gordon) Tai
> Assignee: Seth Wiesman
> Priority: Major
>
> Currently, the State Processor API does not easily allow one to modify the max parallelism of a job. To do that with the current state of the API, one would have to read all state that exists in a loaded {{ExistingSavepoint}}, extract them as {{DataSet}}s, and then create a {{NewSavepoint}} that has the new max parallelism with all the extracted data sets bootstrapped as new state.
> It would be nice if the user could simply do something like the following (API is TBD):
> {code}
> ExistingSavepoint savepoint = Savepoint.load("path", env, backend);
> savepoint.modifyMaxParallelism("newPath", newParallelism);
> {code}
> Under the hood, a batch job is launched that repartitions all existing operator state using the new max parallelism, and writes to the re-partitioned state data into the new savepoint path.
> The API can be designed such that the user cannot modify the max parallelism and add / new operators at the same time, to not over complicate the batch job.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)