You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Ian Downes (JIRA)" <ji...@apache.org> on 2015/02/27 21:53:04 UTC

[jira] [Updated] (MESOS-2421) Processes can be stuck in D state and block container destroy

     [ https://issues.apache.org/jira/browse/MESOS-2421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ian Downes updated MESOS-2421:
------------------------------
    Summary: Processes can be stuck in D state and block container destroy  (was: Processes can be stuck in D state and block destroy)

> Processes can be stuck in D state and block container destroy
> -------------------------------------------------------------
>
>                 Key: MESOS-2421
>                 URL: https://issues.apache.org/jira/browse/MESOS-2421
>             Project: Mesos
>          Issue Type: Bug
>          Components: isolation
>    Affects Versions: 0.21.1
>         Environment: CentOS, 3.10 kernel
>            Reporter: Ian Downes
>              Labels: twitter
>
> We've observed processes getting stuck in D state (uninterruptible sleep) when using the cpu isolator. This prevents the MesosContainerizer launcher from killing all container processes and blocks destroying the container. 
> It appears to be a kernel scheduler bug: the processes can be unstuck by modifying the cpu.cfs_quota_us for the cpu cgroup. This seems to run the processes, deliver the kill signal, and they exit.
> We should implement this workaround in the launcher destroy path when processes are observed to be in D state.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)