You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@oozie.apache.org by "Julian Endres (JIRA)" <ji...@apache.org> on 2016/12/12 09:38:59 UTC

[jira] [Updated] (OOZIE-2758) Improve documentation for retries

     [ https://issues.apache.org/jira/browse/OOZIE-2758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Julian Endres updated OOZIE-2758:
---------------------------------
    Description: 
In the oozie-site.xml the property oozie.action.retry.interval exists. 

It is described as "The interval between retries of an action in case of failure" without specifying a time unit. 

From the propertiey oozie.service.LiteWorkflowStoreService.user.retry.inteval which is described as "Automatic retry interval for workflow action is in minutes and the default value is 10 minutes." the user could assume that the property oozie.action.retry.interval is also minutes. However, as in 

https://github.com/apache/oozie/blob/master/core/src/main/java/org/apache/oozie/action/ActionExecutor.java

one comment states "defaultRetryInterval retry interval, in seconds.". 

In our environment the standard settings are used, and the application is exactly doing this: do a retry every 10 SECONDS (then suspend after max number of retries e.g. in the case of a distcp action). However the user might expect a retry every 10 minutes. 

Here is an excerpt from our log: 

Next Retry, Attempt Number [2] in [10.000] milliseconds

This should at least be correctly documented if not aligned to the same unit.  

  was:
In the oozie-site.xml the property oozie.action.retry.interval exists. 

It is described as "The interval between retries of an action in case of failure" without specifying a time unit. 

From the propertiey oozie.service.LiteWorkflowStoreService.user.retry.inteval which is described as "Automatic retry interval for workflow action is in minutes and the default value is 10 minutes." the user could assume that the property oozie.action.retry.interval is also minutes. However, as in 

https://github.com/apache/oozie/blob/master/core/src/main/java/org/apache/oozie/action/ActionExecutor.java

one comment states "defaultRetryInterval retry interval, in seconds.". 

In our environment the standard settings are used, and the application is exactly doint this: do a retry every 10 SECONDS (then suspend after max number of retries e.g. in the case of a distcp action). However the user might expect a retry every 10 minutes. 

Here is an excerpt from our log: 

Next Retry, Attempt Number [2] in [10.000] milliseconds

This should at least be correctly documented if not aligned to the same unit.  


> Improve documentation for retries
> ---------------------------------
>
>                 Key: OOZIE-2758
>                 URL: https://issues.apache.org/jira/browse/OOZIE-2758
>             Project: Oozie
>          Issue Type: Bug
>          Components: docs
>    Affects Versions: 4.3.0
>            Reporter: Julian Endres
>
> In the oozie-site.xml the property oozie.action.retry.interval exists. 
> It is described as "The interval between retries of an action in case of failure" without specifying a time unit. 
> From the propertiey oozie.service.LiteWorkflowStoreService.user.retry.inteval which is described as "Automatic retry interval for workflow action is in minutes and the default value is 10 minutes." the user could assume that the property oozie.action.retry.interval is also minutes. However, as in 
> https://github.com/apache/oozie/blob/master/core/src/main/java/org/apache/oozie/action/ActionExecutor.java
> one comment states "defaultRetryInterval retry interval, in seconds.". 
> In our environment the standard settings are used, and the application is exactly doing this: do a retry every 10 SECONDS (then suspend after max number of retries e.g. in the case of a distcp action). However the user might expect a retry every 10 minutes. 
> Here is an excerpt from our log: 
> Next Retry, Attempt Number [2] in [10.000] milliseconds
> This should at least be correctly documented if not aligned to the same unit.  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)