You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Chesnay Schepler (JIRA)" <ji...@apache.org> on 2019/07/03 10:15:01 UTC

[jira] [Updated] (FLINK-13060) FailoverStrategies should respect restart constraints

     [ https://issues.apache.org/jira/browse/FLINK-13060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chesnay Schepler updated FLINK-13060:
-------------------------------------
    Description: 
RestartStrategies can define their own restrictions for whether job can be restarted or not. For example, they could count the number of total failures or observe failure rates.

FailoverStrategies are used for partial restarts of jobs, and currently largely bypass the restrictions defined by the restart strategies.

My proposal is the following:

Introduce a new method into the {{RestartStrategy}} interface to notify the strategy of failed task executions. Currently, strategies implicitly handle this in {{RestartStrategy#restart}}, as such the migration of our existing strategies should be trivial.

Next, before calling {{RestartStrategy#restart}}, inform the strategy about the task failure. This retains existing behavior.

  was:
RestartStrategies can define their own restrictions for whether job can be restarted or not. For example, they could count the number of total failures or observe failure rates.

FailoverStrategies are used for partial restarts of jobs, and currently largely bypass the restrictions defined by the restart strategies.


> FailoverStrategies should respect restart constraints
> -----------------------------------------------------
>
>                 Key: FLINK-13060
>                 URL: https://issues.apache.org/jira/browse/FLINK-13060
>             Project: Flink
>          Issue Type: Improvement
>          Components: Runtime / Coordination
>            Reporter: Chesnay Schepler
>            Assignee: Chesnay Schepler
>            Priority: Major
>             Fix For: 1.9.0
>
>
> RestartStrategies can define their own restrictions for whether job can be restarted or not. For example, they could count the number of total failures or observe failure rates.
> FailoverStrategies are used for partial restarts of jobs, and currently largely bypass the restrictions defined by the restart strategies.
> My proposal is the following:
> Introduce a new method into the {{RestartStrategy}} interface to notify the strategy of failed task executions. Currently, strategies implicitly handle this in {{RestartStrategy#restart}}, as such the migration of our existing strategies should be trivial.
> Next, before calling {{RestartStrategy#restart}}, inform the strategy about the task failure. This retains existing behavior.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)