You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Chun-Hung Hsiao (JIRA)" <ji...@apache.org> on 2019/03/07 20:12:00 UTC
[jira] [Created] (MESOS-9639) Make CSI plugin RPC metrics agnostic
to CSI versions.
Chun-Hung Hsiao created MESOS-9639:
--------------------------------------
Summary: Make CSI plugin RPC metrics agnostic to CSI versions.
Key: MESOS-9639
URL: https://issues.apache.org/jira/browse/MESOS-9639
Project: Mesos
Issue Type: Task
Components: storage
Reporter: Chun-Hung Hsiao
Assignee: Chun-Hung Hsiao
Currently SLRP provides per-CSI-call metrics, e.g.:
{noformat}
resource_providers/<rp_type>.<rp_name>/csi_plugin/rpcs/csi.v0.controller.CreateVolume/successes
resource_providers/<rp_type>.<rp_name>/csi_plugin/rpcs/csi.v0.node.NodeGetId/errors
{noformat}
If we are to continue to provide such fine-grained metrics, when operators upgrade their CSI plugins to CSI v1, then SLRP would report another set of metrics for v1, which would be inconvenient to operators.
Also the fine-grained metrics are not very useful for operators, as most information are highly correlated to per-operation metrics. So most likely operators would simply aggregate the per-CSI-call metrics for monitoring CSI plugins, and use per-operation metrics to monitor volume creation/destroy/etc.
So instead of provide such fine-grained metrics, we could just provide a set of aggregated rpc metrics that are agnostic to CSI versions, such as:
{noformat}
resource_providers/<rp_type>.<rp_name>/csi_plugin/rpcs_pending
resource_providers/<rp_type>.<rp_name>/csi_plugin/rpcs_finished
resource_providers/<rp_type>.<rp_name>/csi_plugin/rpcs_failed
resource_providers/<rp_type>.<rp_name>/csi_plugin/rpcs_cancelled
{noformat}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)