You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@helix.apache.org by "Hudson (JIRA)" <ji...@apache.org> on 2014/07/11 19:48:07 UTC
[jira] [Commented] (HELIX-444) add per-participant partition count gauges to helix

    [ https://issues.apache.org/jira/browse/HELIX-444?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14059086#comment-14059086 ] 

Hudson commented on HELIX-444:
------------------------------

FAILURE: Integrated in helix #1257 (See [https://builds.apache.org/job/helix/1257/])
[HELIX-444] add per-participant partition count gauges to helix, rb=21419 (kanak: rev 96aef71c899dc1f3956e1211fc1e9a7459a258d1)
* helix-core/src/test/java/org/apache/helix/monitoring/TestClusterStatusMonitorLifecycle.java
* helix-core/src/main/java/org/apache/helix/controller/stages/BestPossibleStateCalcStage.java
* helix-core/src/test/java/org/apache/helix/monitoring/mbeans/TestResourceMonitor.java
* helix-core/src/main/java/org/apache/helix/monitoring/mbeans/ResourceMonitor.java
* helix-core/src/main/java/org/apache/helix/monitoring/mbeans/ClusterStatusMonitor.java
* helix-core/src/main/java/org/apache/helix/monitoring/mbeans/PerInstanceResourceMonitorMBean.java
* helix-core/src/main/java/org/apache/helix/monitoring/mbeans/InstanceMonitor.java
* helix-core/src/main/java/org/apache/helix/monitoring/mbeans/PerInstanceResourceMonitor.java
* helix-core/src/main/java/org/apache/helix/controller/stages/ExternalViewComputeStage.java
* helix-core/src/test/java/org/apache/helix/monitoring/mbeans/TestClusterStatusMonitor.java
* helix-core/src/main/java/org/apache/helix/api/State.java
* helix-core/src/main/java/org/apache/helix/controller/stages/ClusterDataCache.java


> add per-participant partition count gauges to helix
> ---------------------------------------------------
>
>                 Key: HELIX-444
>                 URL: https://issues.apache.org/jira/browse/HELIX-444
>             Project: Apache Helix
>          Issue Type: Improvement
>            Reporter: Zhen Zhang
>            Assignee: Zhen Zhang
>
> We need a way to pull the known down partition counts out of DifferenceWithIdealState when an instance is offline, reducing the alert volume to solely the down instance notification. Without metrics from helix indicating the number of partitions hosted on a given participant, we can't reason as to which "DifferenceWithIdealState" counts are supposed to be down and which are an actually difference caused by something other than a node outage.
> These should be produced on a per-participant, per-resource basis (ie., helix.i001.participantstatus.cluster.host.db.partitiongauge = 64 or whatever)



--
This message was sent by Atlassian JIRA
(v6.2#6252)