You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Ray Chiang (JIRA)" <ji...@apache.org> on 2016/05/12 21:01:12 UTC

[jira] [Created] (YARN-5078) [Umbrella] NodeManager health checker improvements

Ray Chiang created YARN-5078:
--------------------------------

             Summary: [Umbrella] NodeManager health checker improvements
                 Key: YARN-5078
                 URL: https://issues.apache.org/jira/browse/YARN-5078
             Project: Hadoop YARN
          Issue Type: Bug
          Components: nodemanager
            Reporter: Ray Chiang
            Assignee: Ray Chiang


There have been a bunch of NodeManager health checker improvement requests in the past.

Right now, I expect that initially there just need to be a bunch of base functionality added.  The most obvious parts are:

- Finding appropriate measurements of health
- Storing measurements as metrics.  This should allow easy comparison of good nodes and bad nodes.  This should eventually lead to threshold blacklisting/whitelisting.
- Adding metrics to the NodeManager UI

After this basic functionality is added, we can start consider some enhanced form of NodeManager health status conditions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org