You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Alexander Rukletsov (JIRA)" <ji...@apache.org> on 2015/11/10 01:23:11 UTC

[jira] [Created] (MESOS-3865) Failover and recovery in presence of Quota

Alexander Rukletsov created MESOS-3865:
------------------------------------------

             Summary: Failover and recovery in presence of Quota
                 Key: MESOS-3865
                 URL: https://issues.apache.org/jira/browse/MESOS-3865
             Project: Mesos
          Issue Type: Epic
          Components: allocation, master
            Reporter: Alexander Rukletsov
            Assignee: Alexander Rukletsov


The presence of quota in the cluster changes 

Quota complicates master failover and recovery in several ways. The new master should determine if it is possible to satisfy the total quota and notify an operator in case it's not (imagine simultaneous failovers of multiple agents). The new master should hint the allocator how many agents might reconnect in the future to help it decide how to satisfy quota before the majority of agents reconnect.

The allocator interface should be updated with some sort of recovery information, which will allow it to react properly (e.g. seize offers and hold off resources for some time).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)