You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Neil Conway (JIRA)" <ji...@apache.org> on 2016/02/11 20:47:18 UTC

[jira] [Created] (MESOS-4659) Consider how to handle orphaned tasks after master failover

Neil Conway created MESOS-4659:
----------------------------------

             Summary: Consider how to handle orphaned tasks after master failover
                 Key: MESOS-4659
                 URL: https://issues.apache.org/jira/browse/MESOS-4659
             Project: Mesos
          Issue Type: Bug
          Components: master
            Reporter: Neil Conway


If a framework becomes disconnected from the master, its tasks are killed after waiting for {{failover_timeout}}.

However, if a master failover occurs but a framework never reconnects to the new master, we never kill any of the tasks associated with that framework. These tasks remain orphaned and presumably would need to be manually removed by the operator.

We should consider whether to kill such orphaned tasks automatically, likely after waiting for some (framework-configurable?) timeout.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)