You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "dzcxzl (Jira)" <ji...@apache.org> on 2020/10/15 08:39:00 UTC

[jira] [Created] (YARN-10462) Configurable shutdown cleanup slop

dzcxzl created YARN-10462:
-----------------------------

             Summary: Configurable shutdown cleanup slop
                 Key: YARN-10462
                 URL: https://issues.apache.org/jira/browse/YARN-10462
             Project: Hadoop YARN
          Issue Type: Improvement
          Components: nodemanager
    Affects Versions: 3.1.0
            Reporter: dzcxzl


When stopping NM or decommission NM, stopping all containers, the waiting time is composed of three values sleep-delay-before-sigkill+process-kill-wait+SHUTDOWN_CLEANUP_SLOP_MS (constant 1000)

yarn.nodemanager.sleep-delay-before-sigkill.ms=250
yarn.nodemanager.process-kill-wait.ms=5000
SHUTDOWN_CLEANUP_SLOP_MS=1000

The parameters of sleep-delay-before-sigkill and process-kill-wait are the time to kill a container/process. When there are too many container lists to be killed, it is usually not completely killed.

We can make SHUTDOWN_CLEANUP_SLOP_MS a configurable parameter, so that in some scenarios, we can wait as long as possible to kill all containers to complete.

 

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org