You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "Jim Kellerman (JIRA)" <ji...@apache.org> on 2008/04/01 20:46:25 UTC
[jira] Updated: (HBASE-507) In master, there are a load of places
where no sleep between retries
[ https://issues.apache.org/jira/browse/HBASE-507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jim Kellerman updated HBASE-507:
--------------------------------
Status: Patch Available (was: Open)
Patch for 0.1 available. Please review.
> In master, there are a load of places where no sleep between retries
> --------------------------------------------------------------------
>
> Key: HBASE-507
> URL: https://issues.apache.org/jira/browse/HBASE-507
> Project: Hadoop HBase
> Issue Type: Bug
> Affects Versions: 0.2.0, 0.1.1
> Reporter: stack
> Assignee: Jim Kellerman
> Fix For: 0.2.0, 0.1.1
>
> Attachments: 507-0.1.patch
>
>
> Here is an example:
> {code}
> 270308 2008-03-12 14:10:02,054 DEBUG org.apache.hadoop.hbase.HMaster: numberOfMetaRegions: 1, onlineMetaRegions.size(): 1
> 270309 2008-03-12 14:10:02,054 DEBUG org.apache.hadoop.hbase.HMaster: process server shutdown scanning .META.,,1 on XX.XX.XX.184:60020 HMaster
> 270310 2008-03-12 14:10:02,056 DEBUG org.apache.hadoop.hbase.HMaster: process server shutdown scanning .META.,,1 on XX.XX.XX.184:60020 HMaster
> 270311 2008-03-12 14:10:02,057 DEBUG org.apache.hadoop.hbase.HMaster: process server shutdown scanning .META.,,1 on XX.XX.XX.184:60020 HMaster
> 270312 2008-03-12 14:10:02,059 DEBUG org.apache.hadoop.hbase.HMaster: process server shutdown scanning .META.,,1 on XX.XX.XX.184:60020 HMaster
> 270313 2008-03-12 14:10:02,060 DEBUG org.apache.hadoop.hbase.HMaster: process server shutdown scanning .META.,,1 on XX.XX.XX.184:60020 HMaster
> 270314 2008-03-12 14:10:02,062 WARN org.apache.hadoop.hbase.HMaster: Processing pending operations: ProcessServerShutdown of XX.XX.XX.180:60020
> 270315 org.apache.hadoop.hbase.NotServingRegionException: org.apache.hadoop.hbase.NotServingRegionException .META.,,1
> 270316 at org.apache.hadoop.hbase.HRegionServer.getRegion(HRegionServer.java:1606)
> ...
> {code}
> Whats actually going on here is 5 retries without a wait in between (logging should include index numbering retry. Seems to be a bunch of duplicated code around retrying that we might be able to fix with a Callable. Jim Firby today suggested we do expotential backoffs in our retries.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.