You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Konstantin Ryakhovskiy (Jira)" <ji...@apache.org> on 2022/08/18 22:01:00 UTC

[jira] [Commented] (HBASE-27277) TestRaceBetweenSCPAndTRSP fails in pre commit

    [ https://issues.apache.org/jira/browse/HBASE-27277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17581557#comment-17581557 ] 

Konstantin Ryakhovskiy commented on HBASE-27277:
------------------------------------------------

cannot reproduce on master after 5x attempts
{code:java}
Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 13.741 s - in org.apache.hadoop.hbase.master.assignment.TestRaceBetweenSCPAndTRSP {code}

> TestRaceBetweenSCPAndTRSP fails in pre commit
> ---------------------------------------------
>
>                 Key: HBASE-27277
>                 URL: https://issues.apache.org/jira/browse/HBASE-27277
>             Project: HBase
>          Issue Type: Bug
>          Components: proc-v2
>            Reporter: Duo Zhang
>            Priority: Major
>
> Seems the PE worker is stuck here. Need dig more.
> {noformat}
> "PEWorker-5" daemon prio=5 tid=326 in Object.wait()
> java.lang.Thread.State: WAITING (on object monitor)
>         at java.base@11.0.10/jdk.internal.misc.Unsafe.park(Native Method)
>         at java.base@11.0.10/java.util.concurrent.locks.LockSupport.park(LockSupport.java:194)
>         at java.base@11.0.10/java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:885)
>         at java.base@11.0.10/java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1039)
>         at java.base@11.0.10/java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1345)
>         at java.base@11.0.10/java.util.concurrent.CountDownLatch.await(CountDownLatch.java:232)
>         at app//org.apache.hadoop.hbase.master.assignment.TestRaceBetweenSCPAndTRSP$AssignmentManagerForTest.getRegionsOnServer(TestRaceBetweenSCPAndTRSP.java:97)
>         at app//org.apache.hadoop.hbase.master.procedure.ServerCrashProcedure.getRegionsOnCrashedServer(ServerCrashProcedure.java:288)
>         at app//org.apache.hadoop.hbase.master.procedure.ServerCrashProcedure.executeFromState(ServerCrashProcedure.java:195)
>         at app//org.apache.hadoop.hbase.master.procedure.ServerCrashProcedure.executeFromState(ServerCrashProcedure.java:66)
>         at app//org.apache.hadoop.hbase.procedure2.StateMachineProcedure.execute(StateMachineProcedure.java:188)
>         at app//org.apache.hadoop.hbase.procedure2.Procedure.doExecute(Procedure.java:919)
>         at app//org.apache.hadoop.hbase.procedure2.ProcedureExecutor.execProcedure(ProcedureExecutor.java:1650)
>         at app//org.apache.hadoop.hbase.procedure2.ProcedureExecutor.executeProcedure(ProcedureExecutor.java:1396)
>         at app//org.apache.hadoop.hbase.procedure2.ProcedureExecutor.access$1000(ProcedureExecutor.java:75)
>         at app//org.apache.hadoop.hbase.procedure2.ProcedureExecutor$WorkerThread.runProcedure(ProcedureExecutor.java:1962)
>         at app//org.apache.hadoop.hbase.procedure2.ProcedureExecutor$WorkerThread$$Lambda$477/0x0000000800ac1840.call(Unknown Source)
>         at app//org.apache.hadoop.hbase.trace.TraceUtil.trace(TraceUtil.java:216)
>         at app//org.apache.hadoop.hbase.procedure2.ProcedureExecutor$WorkerThread.run(ProcedureExecutor.java:1989)
> {noformat}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)