You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Alexander Rukletsov (JIRA)" <ji...@apache.org> on 2015/12/15 17:51:46 UTC

[jira] [Updated] (MESOS-3775) MasterAllocatorTest.SlaveLost is slow

     [ https://issues.apache.org/jira/browse/MESOS-3775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Alexander Rukletsov updated MESOS-3775:
---------------------------------------
    Issue Type: Improvement  (was: Bug)

> MasterAllocatorTest.SlaveLost is slow
> -------------------------------------
>
>                 Key: MESOS-3775
>                 URL: https://issues.apache.org/jira/browse/MESOS-3775
>             Project: Mesos
>          Issue Type: Improvement
>          Components: technical debt, test
>            Reporter: Alexander Rukletsov
>            Priority: Minor
>              Labels: mesosphere, tech-debt
>
> The {{MasterAllocatorTest.SlaveLost}} takes more that {{5s}} to complete. A brief look into the code hints that the stopped agent does not quit immediately (and hence its resources are not released by the allocator) because [it waits for the executor to terminate|https://github.com/apache/mesos/blob/master/src/tests/master_allocator_tests.cpp#L717]. {{5s}} timeout comes from {{EXECUTOR_SHUTDOWN_GRACE_PERIOD}} agent constant.
> Possible solutions:
> * Do not wait until the stopped agent quits (can be flaky, needs deeper analysis).
> * Decrease the agent's {{executor_shutdown_grace_period}} flag.
> * Terminate the executor faster (this may require some refactoring since the executor driver is created in the {{TestContainerizer}} and we do not have direct access to it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)