You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "stack (JIRA)" <ji...@apache.org> on 2011/04/29 02:10:03 UTC
[jira] [Updated] (HBASE-3829) TestMasterFailover failures in
jenkins
[ https://issues.apache.org/jira/browse/HBASE-3829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
stack updated HBASE-3829:
-------------------------
Attachment: 3829.patch
> TestMasterFailover failures in jenkins
> --------------------------------------
>
> Key: HBASE-3829
> URL: https://issues.apache.org/jira/browse/HBASE-3829
> Project: HBase
> Issue Type: Bug
> Reporter: stack
> Assignee: stack
> Attachments: 3829.patch
>
>
> We'll fail the TestMasterFailover tests on occasion up on jenkins. One reason for the 180000 timeouts it that test completed but a regionserver won't go down because its stuck over in getMaster. Looking into it, we have all these loops in the regionserver; we have the main run loop but then there are loops trying to send regionserver reportForDuty and then over in the regionserver report method. In a recent fail up on jenkins we were stuck in one of these outer loops trying to get master.
> This patch removes a bunch of the outer loops instead having the outer loops run around the HRegionServer#run loop.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira