You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Dominic Hamon (JIRA)" <ji...@apache.org> on 2014/09/08 20:07:31 UTC
[jira] [Updated] (MESOS-1466) Race between executor exited event
and launch task can cause overcommit of resources
[ https://issues.apache.org/jira/browse/MESOS-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Dominic Hamon updated MESOS-1466:
---------------------------------
Sprint: Q3 Sprint 3, Q3 Sprint 4, Q3 Sprint 5 (was: Q3 Sprint 3, Q3 Sprint 4)
> Race between executor exited event and launch task can cause overcommit of resources
> ------------------------------------------------------------------------------------
>
> Key: MESOS-1466
> URL: https://issues.apache.org/jira/browse/MESOS-1466
> Project: Mesos
> Issue Type: Bug
> Components: allocation, master
> Reporter: Vinod Kone
> Assignee: Benjamin Mahler
> Labels: reliability
>
> The following sequence of events can cause an overcommit
> --> Launch task is called for a task whose executor is already running
> --> Executor's resources are not accounted for on the master
> --> Executor exits and the event is enqueued behind launch tasks on the master
> --> Master sends the task to the slave which needs to commit for resources for task and the (new) executor.
> --> Master processes the executor exited event and re-offers the executor's resources causing an overcommit of resources.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)