You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Daryn Sharp (JIRA)" <ji...@apache.org> on 2013/03/21 18:51:15 UTC
[jira] [Created] (YARN-494) RM should be able to hard stop a
lingering app on a NM
Daryn Sharp created YARN-494:
--------------------------------
Summary: RM should be able to hard stop a lingering app on a NM
Key: YARN-494
URL: https://issues.apache.org/jira/browse/YARN-494
Project: Hadoop YARN
Issue Type: Bug
Components: nodemanager, resourcemanager
Affects Versions: 2.0.0-alpha, 3.0.0, 0.23.3
Reporter: Daryn Sharp
It's possible for a NM to "leak" applications that the RM believes have finished. This currently tends to happen when a lingering app jams in log aggregation or misses the notification to begin aggregation.
Until aggregation completes, the NMs send app keepalive requests to the RM so it continues renewing the app's tokens. This could be extend to allow the RM to send a hard stop to a NM for an app that has been running for a configurable interval of time after the app has finished.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira