You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Dominic Hamon (JIRA)" <ji...@apache.org> on 2014/09/08 20:07:31 UTC

[jira] [Updated] (MESOS-1466) Race between executor exited event and launch task can cause overcommit of resources

     [ https://issues.apache.org/jira/browse/MESOS-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dominic Hamon updated MESOS-1466:
---------------------------------
    Sprint: Q3 Sprint 3, Q3 Sprint 4, Q3 Sprint 5  (was: Q3 Sprint 3, Q3 Sprint 4)

> Race between executor exited event and launch task can cause overcommit of resources
> ------------------------------------------------------------------------------------
>
>                 Key: MESOS-1466
>                 URL: https://issues.apache.org/jira/browse/MESOS-1466
>             Project: Mesos
>          Issue Type: Bug
>          Components: allocation, master
>            Reporter: Vinod Kone
>            Assignee: Benjamin Mahler
>              Labels: reliability
>
> The following sequence of events can cause an overcommit
> --> Launch task is called for a task whose executor is already running
> --> Executor's resources are not accounted for on the master
> --> Executor exits and the event is enqueued behind launch tasks on the master
> --> Master sends the task to the slave which needs to commit for resources for task and the (new) executor.
> --> Master processes the executor exited event and re-offers the executor's resources causing an overcommit of resources.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)