You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2024/01/30 15:44:00 UTC
[jira] [Updated] (FLINK-34311) Do not change min resource requirements when rescaling for adaptive scheduler
[ https://issues.apache.org/jira/browse/FLINK-34311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
ASF GitHub Bot updated FLINK-34311:
-----------------------------------
Labels: pull-request-available (was: )
> Do not change min resource requirements when rescaling for adaptive scheduler
> -----------------------------------------------------------------------------
>
> Key: FLINK-34311
> URL: https://issues.apache.org/jira/browse/FLINK-34311
> Project: Flink
> Issue Type: Improvement
> Components: Kubernetes Operator
> Reporter: Gyula Fora
> Assignee: Gyula Fora
> Priority: Major
> Labels: pull-request-available
> Fix For: kubernetes-operator-1.8.0
>
>
> when applying the rescale api to change parallelism we should not change the min parallelism.
> The problem currently is that if we cannot aquire the new resources within {{jobmanager.adaptive-scheduler.resource-wait-timeout}} the job will completely fail
> The {{jobmanager.adaptive-scheduler.resource-stabilization-timeout}} still allows us to wait for quite long if necessary to get the target parallelism but failing completely because of the wait timeout seems very unfortunate
> It's best to keep the min resources unchanged and let the adaptive scheduler take care of the parallelism changes together with the timeout settings.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)