You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Duo Zhang (JIRA)" <ji...@apache.org> on 2018/11/10 09:49:00 UTC

[jira] [Commented] (HBASE-21377) Missing procedure stack index when restarting

    [ https://issues.apache.org/jira/browse/HBASE-21377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16682302#comment-16682302 ] 

Duo Zhang commented on HBASE-21377:
-----------------------------------

Never seen this for a long time. Plan to close this after the TestMergeTableRegionsProcedure is moved out from the flakey list.

> Missing procedure stack index when restarting
> ---------------------------------------------
>
>                 Key: HBASE-21377
>                 URL: https://issues.apache.org/jira/browse/HBASE-21377
>             Project: HBase
>          Issue Type: Sub-task
>          Components: proc-v2
>            Reporter: Duo Zhang
>            Priority: Major
>             Fix For: 3.0.0, 2.2.0
>
>         Attachments: HBASE-21377-debuglog.patch
>
>
> TestMergeTableRegionsProcedure is still flakey, and found this in the output
> {noformat}
> 2018-10-24 03:46:12,842 ERROR [Time-limited test] wal.WALProcedureTree(198): Missing stack id 6, max stack id is 8, root procedure is Procedure(pid=42, ppid=-1, class=org.apache.hadoop.hbase.master.assignment.MergeTableRegionsProcedure)
> 2018-10-24 03:46:12,847 ERROR [Time-limited test] procedure2.ProcedureExecutor$2(451): Corrupt pid=42, state=WAITING:MERGE_TABLE_REGIONS_CHECK_CLOSED_REGIONS, hasLock=false; MergeTableRegionsProcedure table=testRollbackAndDoubleExecution, regions=[72aed4d14ac73faaa1755e248a55b71a, a848f3ca26989865d5cccc9cd0683ae6], forcibly=false
> 2018-10-24 03:46:12,847 ERROR [Time-limited test] procedure2.ProcedureExecutor$2(451): Corrupt pid=43, ppid=42, state=RUNNABLE:REGION_STATE_TRANSITION_CONFIRM_CLOSED, hasLock=false; TransitRegionStateProcedure table=testRollbackAndDoubleExecution, region=72aed4d14ac73faaa1755e248a55b71a, UNASSIGN
> 2018-10-24 03:46:12,848 ERROR [Time-limited test] procedure2.ProcedureExecutor$2(451): Corrupt pid=44, ppid=42, state=WAITING:REGION_STATE_TRANSITION_CONFIRM_CLOSED, hasLock=false; TransitRegionStateProcedure table=testRollbackAndDoubleExecution, region=a848f3ca26989865d5cccc9cd0683ae6, UNASSIGN
> 2018-10-24 03:46:12,848 ERROR [Time-limited test] procedure2.ProcedureExecutor$2(451): Corrupt pid=45, ppid=43, state=SUCCESS, hasLock=false; org.apache.hadoop.hbase.master.assignment.CloseRegionProcedure
> 2018-10-24 03:46:12,849 ERROR [Time-limited test] procedure2.ProcedureExecutor$2(451): Corrupt pid=46, ppid=44, state=RUNNABLE, hasLock=false; org.apache.hadoop.hbase.master.assignment.CloseRegionProcedure
> {noformat}
> Need to dig more.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)