You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@geode.apache.org by "David Wisler (JIRA)" <ji...@apache.org> on 2016/05/24 16:09:12 UTC

[jira] [Created] (GEODE-1444) Need a gfsh command/tool to analyze customer basic health

David Wisler created GEODE-1444:
-----------------------------------

             Summary: Need a gfsh command/tool to analyze customer basic health
                 Key: GEODE-1444
                 URL: https://issues.apache.org/jira/browse/GEODE-1444
             Project: Geode
          Issue Type: New Feature
          Components: gfsh, statistics
            Reporter: David Wisler


Customers have been increasingly asking for a nice gfsh command that will assess the basic health of their systems to help stave off at the earliest time any issues that might soon impact their clusters.

Such a command would greatly increase customer confidence that the system is indeed operating within healthy parameters.  In addition, it could be used by the Global Support Team to greatly decrease the time spent attempting to assess such health issues as we generally take the first 30 minutes attempting to establish such basic health criteria prior to drilling down to some specific issue.

It is my understanding the most recent Hack Day produced a small prototype of such a command, created by the Lynn's and others.    

Please take this and prioritize this work now that it has some footing.   I believe the benefits of this command would be very evident both externally, becoming a part of customer runbooks, and internally for our teams trying to discover Root Cause of many issues.

If you need some emails from customers for this one, let me know and I will drive that forward.

Such a tool/command could be customized based upon what the customer wants to monitor via use of the command.  This could be configured using properties and/or xml ultimately, or simply use a basic set of 5-10 statistics which can be very effective an early indicators of issues impacting the system.

Can we take this prototype and drive it forward?  I believe the benefit of such a command would increase customer confidence greatly.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)