You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@oozie.apache.org by "Andras Piros (JIRA)" <ji...@apache.org> on 2016/12/12 09:25:58 UTC

[jira] [Commented] (OOZIE-2758) Improve documentation for retries

    [ https://issues.apache.org/jira/browse/OOZIE-2758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15741425#comment-15741425 ] 

Andras Piros commented on OOZIE-2758:
-------------------------------------

[~julianendres] thanks for pointing that out!

For {{4.3.0}} I think changing the documentation is enough. For {{5.0.0}} however I'd prefer also to rename all the properties handling time units as follows:
* {{oozie.action.retry.interval}} -> {{oozie.action.retry.interval.seconds}}
* {{oozie.service.LiteWorkflowStoreService.user.retry.inteval}} -> {{oozie.service.LiteWorkflowStoreService.user.retry.inteval.minutes}}

Opening another JIRA for this.

> Improve documentation for retries
> ---------------------------------
>
>                 Key: OOZIE-2758
>                 URL: https://issues.apache.org/jira/browse/OOZIE-2758
>             Project: Oozie
>          Issue Type: Bug
>          Components: docs
>    Affects Versions: 4.3.0
>            Reporter: Julian Endres
>
> In the oozie-site.xml the property oozie.action.retry.interval exists. 
> It is described as "The interval between retries of an action in case of failure" without specifying a time unit. 
> From the propertiey oozie.service.LiteWorkflowStoreService.user.retry.inteval which is described as "Automatic retry interval for workflow action is in minutes and the default value is 10 minutes." the user could assume that the property oozie.action.retry.interval is also minutes. However, as in 
> https://github.com/apache/oozie/blob/master/core/src/main/java/org/apache/oozie/action/ActionExecutor.java
> one comment states "defaultRetryInterval retry interval, in seconds.". 
> In our environment the standard settings are used, and the application is exactly doint this: do a retry every 10 SECONDS (then suspend after max number of retries e.g. in the case of a distcp action). However the user might expect a retry every 10 minutes. 
> Here is an excerpt from our log: 
> Next Retry, Attempt Number [2] in [10.000] milliseconds
> This should at least be correctly documented if not aligned to the same unit.  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)