You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Benjamin Mahler (JIRA)" <ji...@apache.org> on 2015/02/27 03:59:05 UTC
[jira] [Commented] (MESOS-2392) Rate limit slaves removals during
master recovery.
[ https://issues.apache.org/jira/browse/MESOS-2392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14339653#comment-14339653 ]
Benjamin Mahler commented on MESOS-2392:
----------------------------------------
Reviews:
https://reviews.apache.org/r/31511/
https://reviews.apache.org/r/31512/
https://reviews.apache.org/r/31514/
https://reviews.apache.org/r/31515/
https://reviews.apache.org/r/31516/
> Rate limit slaves removals during master recovery.
> --------------------------------------------------
>
> Key: MESOS-2392
> URL: https://issues.apache.org/jira/browse/MESOS-2392
> Project: Mesos
> Issue Type: Improvement
> Components: master
> Reporter: Benjamin Mahler
> Assignee: Benjamin Mahler
> Labels: twitter
>
> Much like we rate limit slave removals in the common path (MESOS-1148), we need to rate limit slave removals that occur during master recovery. When a master recovers and is using a strict registry, slaves that do not re-register within a timeout will be removed.
> Currently there is a safeguard in place to abort when too many slaves have not re-registered. However, in the case of a transient partition, we don't want to remove large sections of slaves without rate limiting.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)