You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Eric Badger (JIRA)" <ji...@apache.org> on 2018/08/22 15:00:00 UTC

[jira] [Assigned] (YARN-7644) NM gets backed up deleting docker containers

     [ https://issues.apache.org/jira/browse/YARN-7644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Eric Badger reassigned YARN-7644:
---------------------------------

    Assignee: Chandni Singh  (was: Eric Badger)

[~csingh], assigned to you. Thanks for picking this up

> NM gets backed up deleting docker containers
> --------------------------------------------
>
>                 Key: YARN-7644
>                 URL: https://issues.apache.org/jira/browse/YARN-7644
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: nodemanager
>            Reporter: Eric Badger
>            Assignee: Chandni Singh
>            Priority: Major
>              Labels: Docker
>
> We are sending a {{docker stop}} to the docker container with a timeout of 10 seconds when we shut down a container. If the container does not stop after 10 seconds then we force kill it. However, the {{docker stop}} command is a blocking call. So in cases where lots of containers don't go down with the initial SIGTERM, we have to wait 10+ seconds for the {{docker stop}} to return. This ties up the ContainerLaunch handler and so these kill events back up. It also appears to be backing up new container launches as well. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org