You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@aurora.apache.org by "Kai Huang (JIRA)" <ji...@apache.org> on 2017/07/06 19:01:00 UTC
[jira] [Updated] (AURORA-1940) aurora job restart request should be retryable

     [ https://issues.apache.org/jira/browse/AURORA-1940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kai Huang updated AURORA-1940:
------------------------------
    Description: 
There was a recent change to the Aurora client to provide "at most once" instead of "at least once" retries for non-idempotent operations. See:
https://github.com/apache/aurora/commit/f1e25375def5a047da97d8bdfb47a3a9101568f6

`aurora job restart` is a non-idempotent operation, thus it was not retried. When there is a transport exception, the operator has to babysit simple operations like aurora job restart if it were not retried. Compared to the requests that were causing problems (admin tasks, job creating, updates, etc.), restarts in general should be retried rather than erring on the side of caution.

  was:
There was a recent change to the Aurora client to provide "at most once" instead of "at least once" retries for non-idempotent operations. See:
https://github.com/apache/aurora/commit/f1e25375def5a047da97d8bdfb47a3a9101568f6

`aurora job restart` is a non-idempotent operation, thus it was not retried. However, during a scheduler failover, the operator has to babysit simple operations like aurora job restart if it were not retried. Compared to the requests that were causing problems (admin tasks, job creating, updates, etc.), restarts in general should be retried rather than erring on the side of caution.


> aurora job restart request should be retryable
> ----------------------------------------------
>
>                 Key: AURORA-1940
>                 URL: https://issues.apache.org/jira/browse/AURORA-1940
>             Project: Aurora
>          Issue Type: Task
>            Reporter: Kai Huang
>            Assignee: Kai Huang
>            Priority: Minor
>
> There was a recent change to the Aurora client to provide "at most once" instead of "at least once" retries for non-idempotent operations. See:
> https://github.com/apache/aurora/commit/f1e25375def5a047da97d8bdfb47a3a9101568f6
> `aurora job restart` is a non-idempotent operation, thus it was not retried. When there is a transport exception, the operator has to babysit simple operations like aurora job restart if it were not retried. Compared to the requests that were causing problems (admin tasks, job creating, updates, etc.), restarts in general should be retried rather than erring on the side of caution.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)