You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "stack (JIRA)" <ji...@apache.org> on 2009/04/25 00:41:30 UTC

[jira] Updated: (HBASE-773) Service Historian

     [ https://issues.apache.org/jira/browse/HBASE-773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

stack updated HBASE-773:
------------------------

    Fix Version/s:     (was: 0.20.0)

Moving out of 0.20.0. Non-critical. If a patch shows up before release, we'll add it.

> Service Historian
> -----------------
>
>                 Key: HBASE-773
>                 URL: https://issues.apache.org/jira/browse/HBASE-773
>             Project: Hadoop HBase
>          Issue Type: New Feature
>          Components: master
>    Affects Versions: 0.18.0
>            Reporter: Andrew Purtell
>            Priority: Minor
>
> The Region Historian (see HBASE-533) is very useful for debugging issues on the cluster involving region splitting, assignment, etc. It would be additionally useful if the master could keep a separate history of regionservers, when they:
> * start up and report in
> * quiesce/exit when the master tells them to
> * fail (and report error?) and exit
> * are declared dead after their lease expires
> * are assigned a region (some overlap with Region Historian but is a different view)
> * are asked to close a region  (some overlap with Region Historian but is a different view)
> Maybe call it a Service Historian?
> There should be event logs per regionserver identity, available even if a regionserver is offline. The logs can have a simple structure: Timestamp, Event, Description, like the Region Historian tables. 
> Otherwise it is still necessary to comb through logs to determine if a regionserver was flaky during a period of time. 
> Additionally, if regionservers can send an error string when they abort and restart, such that the errors can be viewed in a service history table, that would be really helpful.
> Hyperlinks in the service history table would make it easy to follow a table and its regions over the lifetime of the system, a reconstruction essentially of the client view of the cluster over time. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.