You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Eric Badger (Jira)" <ji...@apache.org> on 2021/03/15 21:05:00 UTC
[jira] [Commented] (YARN-10688) ClusterMetrics should support GPU
capacity related metrics.
[ https://issues.apache.org/jira/browse/YARN-10688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17301987#comment-17301987 ]
Eric Badger commented on YARN-10688:
------------------------------------
[~zhuqi], thanks for the updated patch. To make things a little cleaner, I think we can do something like this instead of having 2 separate methods.
{noformat}
public long getCapabilityGPUs() {
if (capabilityGPUs == null) {
return 0;
}
return capabilityGPUs.value();
}
{noformat}
This works in my non-GPU environment. I think it's cleaner, but need you to test it out in your GPU environment to make sure it works ok. And then of course update the unit tests to use {{getCapabilitiyGPUs}}.
> ClusterMetrics should support GPU capacity related metrics.
> -----------------------------------------------------------
>
> Key: YARN-10688
> URL: https://issues.apache.org/jira/browse/YARN-10688
> Project: Hadoop YARN
> Issue Type: Improvement
> Components: metrics, resourcemanager
> Affects Versions: 3.2.2, 3.4.0
> Reporter: Qi Zhu
> Assignee: Qi Zhu
> Priority: Major
> Attachments: YARN-10688.001.patch, YARN-10688.002.patch, image-2021-03-11-15-35-49-625.png
>
>
> Now the ClusterMetrics only support memory and Vcore related metrics.
>
> {code:java}
> @Metric("Memory Utilization") MutableGaugeLong utilizedMB;
> @Metric("Vcore Utilization") MutableGaugeLong utilizedVirtualCores;
> @Metric("Memory Capability") MutableGaugeLong capabilityMB;
> @Metric("Vcore Capability") MutableGaugeLong capabilityVirtualCores;
> {code}
>
>
> !image-2021-03-11-15-35-49-625.png|width=593,height=253!
> In our cluster, we added GPU supported, so i think the GPU related metrics should also be supported by ClusterMetrics.
>
> cc [~pbacsko] [~Jim_Brennan] [~ebadger] [~gandras]
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org