You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "stack (JIRA)" <ji...@apache.org> on 2011/05/05 07:47:03 UTC
[jira] [Commented] (HBASE-3801) Backup Master blocked when the
HMaster Node Fail.
[ https://issues.apache.org/jira/browse/HBASE-3801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13029141#comment-13029141 ]
stack commented on HBASE-3801:
------------------------------
I see that we register a listener before we go to wait on becoming master:
{code}
private boolean becomeActiveMaster(MonitoredTask startupStatus)
throws InterruptedException {
// TODO: This is wrong!!!! Should have new servername if we restart ourselves,
// if we come back to life.
this.activeMasterManager = new ActiveMasterManager(zooKeeper, this.serverName,
this);
this.zooKeeper.registerListener(activeMasterManager);
stallIfBackupMaster(this.conf, this.activeMasterManager);
return this.activeMasterManager.blockUntilBecomingActiveMaster(startupStatus);
}
{code}
So what is wrong in the above code? Do you have fellas have a patch?
Thanks.
> Backup Master blocked when the HMaster Node Fail.
> -------------------------------------------------
>
> Key: HBASE-3801
> URL: https://issues.apache.org/jira/browse/HBASE-3801
> Project: HBase
> Issue Type: Bug
> Components: master
> Affects Versions: 0.90.2
> Environment: 1 HMaster
> 1 HMaster -backup
> 6 HResignServer
> Reporter: Aaron Guo
>
> When the HMaster crash, the Backup HMaster blocked for waiting the ZK notify.
> The Backup HMaster's thread stack is :
> "master-hp1:60000" prio=10 tid=0x00000000484c6800 nid=0x4b56 waiting on condition [0x0000000040209000]
> java.lang.Thread.State: TIMED_WAITING (sleeping)
> at java.lang.Thread.sleep(Native Method)
> at org.apache.hadoop.hbase.master.HMaster.stallIfBackupMaster(HMaster.java:251)
> at org.apache.hadoop.hbase.master.HMaster.run(HMaster.java:279)
> Locked ownable synchronizers:
> - None
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira