You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Duo Zhang (JIRA)" <ji...@apache.org> on 2018/11/19 09:11:00 UTC

[jira] [Assigned] (HBASE-21377) Add debug log for procedure stack id related operations

     [ https://issues.apache.org/jira/browse/HBASE-21377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Duo Zhang reassigned HBASE-21377:
---------------------------------

    Assignee: Duo Zhang

> Add debug log for procedure stack id related operations
> -------------------------------------------------------
>
>                 Key: HBASE-21377
>                 URL: https://issues.apache.org/jira/browse/HBASE-21377
>             Project: HBase
>          Issue Type: Sub-task
>          Components: proc-v2
>            Reporter: Duo Zhang
>            Assignee: Duo Zhang
>            Priority: Major
>             Fix For: 3.0.0, 2.2.0
>
>         Attachments: HBASE-21377-debuglog.patch
>
>
> TestMergeTableRegionsProcedure is still flakey, and found this in the output
> {noformat}
> 2018-10-24 03:46:12,842 ERROR [Time-limited test] wal.WALProcedureTree(198): Missing stack id 6, max stack id is 8, root procedure is Procedure(pid=42, ppid=-1, class=org.apache.hadoop.hbase.master.assignment.MergeTableRegionsProcedure)
> 2018-10-24 03:46:12,847 ERROR [Time-limited test] procedure2.ProcedureExecutor$2(451): Corrupt pid=42, state=WAITING:MERGE_TABLE_REGIONS_CHECK_CLOSED_REGIONS, hasLock=false; MergeTableRegionsProcedure table=testRollbackAndDoubleExecution, regions=[72aed4d14ac73faaa1755e248a55b71a, a848f3ca26989865d5cccc9cd0683ae6], forcibly=false
> 2018-10-24 03:46:12,847 ERROR [Time-limited test] procedure2.ProcedureExecutor$2(451): Corrupt pid=43, ppid=42, state=RUNNABLE:REGION_STATE_TRANSITION_CONFIRM_CLOSED, hasLock=false; TransitRegionStateProcedure table=testRollbackAndDoubleExecution, region=72aed4d14ac73faaa1755e248a55b71a, UNASSIGN
> 2018-10-24 03:46:12,848 ERROR [Time-limited test] procedure2.ProcedureExecutor$2(451): Corrupt pid=44, ppid=42, state=WAITING:REGION_STATE_TRANSITION_CONFIRM_CLOSED, hasLock=false; TransitRegionStateProcedure table=testRollbackAndDoubleExecution, region=a848f3ca26989865d5cccc9cd0683ae6, UNASSIGN
> 2018-10-24 03:46:12,848 ERROR [Time-limited test] procedure2.ProcedureExecutor$2(451): Corrupt pid=45, ppid=43, state=SUCCESS, hasLock=false; org.apache.hadoop.hbase.master.assignment.CloseRegionProcedure
> 2018-10-24 03:46:12,849 ERROR [Time-limited test] procedure2.ProcedureExecutor$2(451): Corrupt pid=46, ppid=44, state=RUNNABLE, hasLock=false; org.apache.hadoop.hbase.master.assignment.CloseRegionProcedure
> {noformat}
> Need to dig more.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)