You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Ufuk Celebi (JIRA)" <ji...@apache.org> on 2015/11/11 10:07:10 UTC

[jira] [Commented] (FLINK-2993) Set default DelayBetweenExecutionRetries to 0

    [ https://issues.apache.org/jira/browse/FLINK-2993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15000134#comment-15000134 ] 

Ufuk Celebi commented on FLINK-2993:
------------------------------------

I agree that 100 seconds default is too high. Let's go for something lower.

But bear in mind that it will give a strange user experience when the restarting does not work, because of missing task slots. If it is too aggressive, you will see redeployment failures.

We should go for a less aggressive delay as default for HA (for now), where job recovery is delayed by this timeout as well and the task managers have to be notified about the new leader and reconnect within this time.

It will also be helpful to print this delay at the client when the state changes to RESTARTING. What do you think?

> Set default DelayBetweenExecutionRetries to 0
> ---------------------------------------------
>
>                 Key: FLINK-2993
>                 URL: https://issues.apache.org/jira/browse/FLINK-2993
>             Project: Flink
>          Issue Type: Improvement
>          Components: Core
>    Affects Versions: 0.10
>            Reporter: Stephan Ewen
>            Assignee: Stephan Ewen
>             Fix For: 1.0
>
>
> The default value is too high and gives a strange user experience.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)