You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@aurora.apache.org by "Moses Nakamura (JIRA)" <ji...@apache.org> on 2015/03/14 04:44:38 UTC
[jira] [Comment Edited] (AURORA-894) Server updater should watch
healthy instances
[ https://issues.apache.org/jira/browse/AURORA-894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14361577#comment-14361577 ]
Moses Nakamura edited comment on AURORA-894 at 3/14/15 3:43 AM:
----------------------------------------------------------------
I took a first stab at the python bits here: https://reviews.apache.org/r/31104/ but I didn't have time to write tests or do any of the java bits. Let me know if I can do anything to help.
was (Author: moses.nakamura):
I took a first stab at the python bits here: https://reviews.apache.org/r/31104/ but I didn't have time to write tests or do any of the java bits. Let me know if you need any help.
> Server updater should watch healthy instances
> ---------------------------------------------
>
> Key: AURORA-894
> URL: https://issues.apache.org/jira/browse/AURORA-894
> Project: Aurora
> Issue Type: Task
> Components: Scheduler
> Reporter: Maxim Khutornenko
> Assignee: Maxim Khutornenko
>
> Instead of starting the {{minWaitInInstanceRunningMs}} (aka {{watch_secs}}) countdown when an instance reaches RUNNING state, the updater should rely on the first successful health check instead. This will potentially speed up updates as the {{minWaitInInstanceRunningMs}} will no longer have to be chosen based on the worst observed instance startup/warmup delay but rather as a desired health check duration according to the following formula:
> {noformat}
> minWaitInInstanceRunningMs = interval_secs x num_desired_healthchecks x 1000
> {noformat}
> where:
> {{interval_secs}} - https://github.com/apache/incubator-aurora/blob/master/docs/configuration-reference.md#healthcheckconfig-objects
> {{num_desired_healthchecks}} - the desired number of OK health checks to observe before declaring an instance updated successfully
>
> The above would allow every instance to start watching interval depending on the individual instance performance and potentially exit updater earlier. This feature requires AURORA-279.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)