You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@hbase.apache.org by "stack (JIRA)" <ji...@apache.org> on 2011/07/08 21:57:16 UTC

[jira] [Commented] (HBASE-4058) Extend TestHBaseFsck with a complete .META. recovery scenario

    [ https://issues.apache.org/jira/browse/HBASE-4058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13062138#comment-13062138 ] 

stack commented on HBASE-4058:
------------------------------

Here is the thread that prompted this issue: http://search-hadoop.com/m/J27Y72CrGiD/%2522hbck+-fix%2522&subj=hbck+fix

So, one though I had was rebuilding .META. from a scan of .META. with a timestamp behind the catastrophe.  This is not going to be bullet-proof for the case where the .META. storefiles themselves have been damaged or lost.

So, we need a new add_table type fixup.  Wayne in the thread describes it as:

{quote}
Bugs and human error will bring on problems and nothing will
ever change that, but not having tools to help recover out of the hole is
where I think it is lacking...The hbase .META. table
(and -ROOT-?) are the core how HBase manages things. If this gets out of
whack all is lost...Something like a recovery mode that goes through and
sees what is out there and rebuilds the meta based on it. With corrupted
data and lost regions etc. etc. like any relational database there should be
one or more recovery modes that goes through everything and rebuilds it
consistently. Data may be lost but at least the cluster will be left in a
100% consistent/clean state. Manual editing of .META. is not something
anyone should do (especially me). It is prone to human error...it should be
easy to have well tested recover tools that can do the hard work for us.
{quote}



> Extend TestHBaseFsck with a complete .META. recovery scenario
> -------------------------------------------------------------
>
>                 Key: HBASE-4058
>                 URL: https://issues.apache.org/jira/browse/HBASE-4058
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Andrew Purtell
>            Assignee: Andrew Purtell
>             Fix For: 0.92.0
>
>
> We should have a unit test that launches a minicluster and constructs a few tables, then deletes META files on disk, then bounces the master, then recovers the result with HBCK. Perhaps it is possible to extend TestHBaseFsck to do this.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira