You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Benjamin Mahler (JIRA)" <ji...@apache.org> on 2015/02/24 03:23:11 UTC

[jira] [Created] (MESOS-2392) Rate limit slaves that are removed during master recovery.

Benjamin Mahler created MESOS-2392:
--------------------------------------

             Summary: Rate limit slaves that are removed during master recovery.
                 Key: MESOS-2392
                 URL: https://issues.apache.org/jira/browse/MESOS-2392
             Project: Mesos
          Issue Type: Improvement
          Components: master
            Reporter: Benjamin Mahler
            Assignee: Benjamin Mahler


Much like we rate limit slave removals in the common path (MESOS-1148), we need to rate limit slave removals that occur during master recovery. When a master recovers and is using a strict registry, slaves that do not re-register within a timeout will be removed.

Currently there is a safeguard in place to abort when too many slaves have not re-registered. However, in the case of a transient partition, we don't want to remove large sections of slaves without rate limiting.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)