You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Nick Dimiduk (Jira)" <ji...@apache.org> on 2020/08/10 23:00:00 UTC

[jira] [Updated] (HBASE-24015) Coverage for Assign and Unassign of Regions on RegionServer on failure

     [ https://issues.apache.org/jira/browse/HBASE-24015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Nick Dimiduk updated HBASE-24015:
---------------------------------
    Fix Version/s:     (was: 2.3.1)
                   2.3.2

> Coverage for Assign and Unassign of Regions on RegionServer on failure
> ----------------------------------------------------------------------
>
>                 Key: HBASE-24015
>                 URL: https://issues.apache.org/jira/browse/HBASE-24015
>             Project: HBase
>          Issue Type: Test
>          Components: amv2
>            Reporter: Michael Stack
>            Assignee: Sandeep Pal
>            Priority: Major
>             Fix For: 3.0.0-alpha-1, 1.7.0, 2.4.0, 2.3.2
>
>
> Looking at 'HBASE-23984 [Flakey Tests] TestMasterAbortAndRSGotKilled fails in teardown', and at UnassignRegionHandler, AssignRegionHandler, CloseRegionHandler, and the work that is done inline w/ request vs that which to the side in executors, we need more coverage and specification of what happens around the edges. This coverage would be more to see if holes in our handling currently in a unit test case context before we see it out on clusters.
> HBASE-23984  addresses holes where UnassignRegionHandler and AssignRegionHandler could skip out w/o clearing Regions from the RegionServer#regionsInTransitionInRS Map of Regions In Transition if failed open or close because the RegionServer is aborting.
> Other holes seem lurking. On exception, we were leaving entries in the RegionServer# submittedRegionProcedure map added by HBASE-2204; not the end of the world but they should be cleared on error? HBASE-23984 adds clearning from submittedRegionProcedure but then procedures even if failed get added to the cache of procedures... so if we try to run the procedure again against this server it won't be scheduled.
> interesting stuff.
> This issue is about adding tests that fail assign/unassign/close on the RegionServer side making sure RS state is left in a good condition on fail.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)