You are viewing a plain text version of this content. The canonical link for it is here.
Posted to hdfs-issues@hadoop.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/07/21 14:32:00 UTC

[jira] [Work logged] (HDFS-16678) RBF supports disable getNodeUsage() in RBFMetrics

     [ https://issues.apache.org/jira/browse/HDFS-16678?focusedWorklogId=793769&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-793769 ]

ASF GitHub Bot logged work on HDFS-16678:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 21/Jul/22 14:31
            Start Date: 21/Jul/22 14:31
    Worklog Time Spent: 10m 
      Work Description: ZanderXu opened a new pull request, #4606:
URL: https://github.com/apache/hadoop/pull/4606

   ### Description of PR
   In our prod environment, we try to collect RBF metrics every 15s through jmx_exporter. And we found that collection task often failed. 
   
   After tracing and found that the collection task is blocked at getNodeUsage() in RBFMetrics, because it will collect all datanode's usage from downstream nameservices.  
   
   This is a very expensive and almost useless operation. Because in most scenarios, each downstream nameserivce contains almost the same DNs. We can get the data usage's from any one nameservices if need, not from RBF.
   
   So I feel that RBF should supports disable getNodeUsage() in RBFMetrics.
   
   




Issue Time Tracking
-------------------

            Worklog Id:     (was: 793769)
    Remaining Estimate: 0h
            Time Spent: 10m

> RBF supports disable getNodeUsage() in RBFMetrics
> -------------------------------------------------
>
>                 Key: HDFS-16678
>                 URL: https://issues.apache.org/jira/browse/HDFS-16678
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>            Reporter: ZanderXu
>            Assignee: ZanderXu
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> In our prod environment, we try to collect RBF metrics every 15s through jmx_exporter. And we found that collection task often failed. 
> After tracing and found that the collection task is blocked at getNodeUsage() in RBFMetrics, because it will collection all datanode's usage from downstream nameservices.  This is a very expensive and almost useless operation. Because in most scenarios, each NameSerivce contains almost the same DNs. We can get the data usage's from any one nameservices, not from RBF.
> So I feel that RBF should supports disable getNodeUsage() in RBFMetrics.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org