You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-commits@hadoop.apache.org by xg...@apache.org on 2016/02/04 05:06:01 UTC

hadoop git commit: YARN-4662. Document some newly added metrics. Contributed by Jian He

Repository: hadoop
Updated Branches:
  refs/heads/trunk c89a14a8a -> 63c63e298


YARN-4662. Document some newly added metrics. Contributed by Jian He


Project: http://git-wip-us.apache.org/repos/asf/hadoop/repo
Commit: http://git-wip-us.apache.org/repos/asf/hadoop/commit/63c63e29
Tree: http://git-wip-us.apache.org/repos/asf/hadoop/tree/63c63e29
Diff: http://git-wip-us.apache.org/repos/asf/hadoop/diff/63c63e29

Branch: refs/heads/trunk
Commit: 63c63e298cf9ff252532297deedde15e77323809
Parents: c89a14a
Author: Xuan <xg...@apache.org>
Authored: Wed Feb 3 20:05:22 2016 -0800
Committer: Xuan <xg...@apache.org>
Committed: Wed Feb 3 20:05:22 2016 -0800

----------------------------------------------------------------------
 .../hadoop-common/src/site/markdown/Metrics.md  | 27 +++++++++++++++++---
 hadoop-yarn-project/CHANGES.txt                 |  2 ++
 2 files changed, 25 insertions(+), 4 deletions(-)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/hadoop/blob/63c63e29/hadoop-common-project/hadoop-common/src/site/markdown/Metrics.md
----------------------------------------------------------------------
diff --git a/hadoop-common-project/hadoop-common/src/site/markdown/Metrics.md b/hadoop-common-project/hadoop-common/src/site/markdown/Metrics.md
index a91bbad..b4ab19a 100644
--- a/hadoop-common-project/hadoop-common/src/site/markdown/Metrics.md
+++ b/hadoop-common-project/hadoop-common/src/site/markdown/Metrics.md
@@ -344,10 +344,16 @@ ClusterMetrics shows the metrics of the YARN cluster from the ResourceManager's
 | Name | Description |
 |:---- |:---- |
 | `NumActiveNMs` | Current number of active NodeManagers |
+| `numDecommissioningNMs` | Current number of NodeManagers being decommissioned|
 | `NumDecommissionedNMs` | Current number of decommissioned NodeManagers |
-| `NumLostNMs` | Current number of lost NodeManagers for not sending heartbeats |
+| `NumShutdownNMs` | Current number of NodeManagers shut down gracefully. Note that this does not count NodeManagers that are forcefully killed. |
+| `NumLostNMs` | Current number of lost NodeManagers for not sending heartbeats. |
 | `NumUnhealthyNMs` | Current number of unhealthy NodeManagers |
 | `NumRebootedNMs` | Current number of rebooted NodeManagers |
+| `AMLaunchDelayNumOps` | Total number of AMs launched |
+| `AMLaunchDelayAvgTime` | Average time in milliseconds RM spends to launch AM containers after the AM container is allocated|
+| `AMRegisterDelayNumOps` | Total number of AMs registered  |
+| `AMRegisterDelayAvgTime` | Average time in milliseconds AM spends to register with RM after the AM container gets launched |
 
 QueueMetrics
 ------------
@@ -372,17 +378,22 @@ In `running_`*num* metrics such as `running_0`, you can set the property `yarn.r
 | `AllocatedVCores` | Current allocated CPU in virtual cores |
 | `AllocatedContainers` | Current number of allocated containers |
 | `AggregateContainersAllocated` | Total number of allocated containers |
+| `aggregateNodeLocalContainersAllocated` | Total number of node local containers allocated  |
+| `aggregateRackLocalContainersAllocated` | Total number of rack local containers allocated  |
+| `aggregateOffSwitchContainersAllocated` | Total number of off switch containers allocated |
 | `AggregateContainersReleased` | Total number of released containers |
 | `AvailableMB` | Current available memory in MB |
 | `AvailableVCores` | Current available CPU in virtual cores |
-| `PendingMB` | Current pending memory resource requests in MB that are not yet fulfilled by the scheduler |
-| `PendingVCores` | Current pending CPU allocation requests in virtual cores that are not yet fulfilled by the scheduler |
-| `PendingContainers` | Current pending resource requests that are not yet fulfilled by the scheduler |
+| `PendingMB` | Current memory requests in MB that are pending to be fulfilled by the scheduler |
+| `PendingVCores` | Current CPU requests in virtual cores that are pending to be fulfilled by the scheduler |
+| `PendingContainers` | Current number of containers that are pending to be fulfilled by the scheduler |
 | `ReservedMB` | Current reserved memory in MB |
 | `ReservedVCores` | Current reserved CPU in virtual cores |
 | `ReservedContainers` | Current number of reserved containers |
 | `ActiveUsers` | Current number of active users |
 | `ActiveApplications` | Current number of active applications |
+| `AppAttemptFirstContainerAllocationDelayNumOps` | Total number of first container allocated for all attempts |
+| `AppAttemptFirstContainerAllocationDelayAvgTime` | Average time RM spends to allocate the first container for all attempts. For managed AM, the first container is AM container. So, this indicates the time duration to allocate AM container. For unmanaged AM, this is the time duration to allocate the first container asked by unmanaged AM. |
 | `FairShareMB` | (FairScheduler only) Current fair share of memory in MB |
 | `FairShareVCores` | (FairScheduler only) Current fair share of CPU in virtual cores |
 | `MinShareMB` | (FairScheduler only) Minimum share of memory in MB |
@@ -406,6 +417,14 @@ NodeManagerMetrics shows the statistics of the containers in the node. Each metr
 | `allocatedContainers` | Current number of allocated containers |
 | `allocatedGB` | Current allocated memory in GB |
 | `availableGB` | Current available memory in GB |
+| `allocatedVcores` | Current used vcores|
+| `availableVcores` | Current available vcores |
+| `containerLaunchDuration` | Average time duration in milliseconds NM takes to launch a container|
+| `badLocalDirs` | Current number of bad local directories. Currently, a disk that cannot be read/written/executed by NM process or A disk being full is considered as bad.|
+| `badLogDirs` | Current number of bad log directories. Currently, a disk that cannot be read/written/executed by NM process or A disk being full is considered as bad. |
+| `goodLocalDirsDiskUtilizationPerc` | Current disk utilization percentage across all good local directories |
+| `goodLogDirsDiskUtilizationPerc` | Current disk utilization percentage across all good log directories |
+
 
 ugi context
 ===========

http://git-wip-us.apache.org/repos/asf/hadoop/blob/63c63e29/hadoop-yarn-project/CHANGES.txt
----------------------------------------------------------------------
diff --git a/hadoop-yarn-project/CHANGES.txt b/hadoop-yarn-project/CHANGES.txt
index 9a8252c..dc27da7 100644
--- a/hadoop-yarn-project/CHANGES.txt
+++ b/hadoop-yarn-project/CHANGES.txt
@@ -788,6 +788,8 @@ Release 2.8.0 - UNRELEASED
     YARN-4100. Add Documentation for Distributed and Delegated-Centralized
     Node Labels feature. (Naganarasimha G R via devaraj)
 
+    YARN-4662. Document some newly added metrics. (Jian He via xgong)
+
   OPTIMIZATIONS
 
     YARN-3339. TestDockerContainerExecutor should pull a single image and not