You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "Andrew Purtell (JIRA)" <ji...@apache.org> on 2010/02/04 05:42:28 UTC
[jira] Updated: (HBASE-1964) Enter temporary "safe mode" to ride
over transient FS layer problems
[ https://issues.apache.org/jira/browse/HBASE-1964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Andrew Purtell updated HBASE-1964:
----------------------------------
Affects Version/s: (was: 0.20.1)
Fix Version/s: 0.21.0
Assignee: Andrew Purtell
Summary: Enter temporary "safe mode" to ride over transient FS layer problems (was: Add internal status monitoring to RegionServer)
Refocus this issue as "Enter temporary "safe mode" to ride over transient FS layer problems", as part of ride over restart.
> Enter temporary "safe mode" to ride over transient FS layer problems
> --------------------------------------------------------------------
>
> Key: HBASE-1964
> URL: https://issues.apache.org/jira/browse/HBASE-1964
> Project: Hadoop HBase
> Issue Type: Improvement
> Components: client
> Reporter: elsif
> Assignee: Andrew Purtell
> Fix For: 0.21.0
>
>
> When a hadoop/hbase cluster is under heavy load it will inevitably reach a tipping point where data is lost or corrupted. A
> graceful method is needed to put the cluster into safe mode until more resources can be added or the load on the cluster has been
> reduced.
> St.Ack has suggested the following short-term task: "Meantime, it should be possible to have a cron run a script that checks
> cluster resources from time-to-time -- e.g. how full hdfs is, how much each regionserver is carrying -- and when it determines the needle is in the red,
> flip the cluster to be read-only."
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.