You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Ted Yu (JIRA)" <ji...@apache.org> on 2013/08/20 12:33:52 UTC

[jira] [Commented] (HBASE-9254) TestHBaseFsck occasionally hung

    [ https://issues.apache.org/jira/browse/HBASE-9254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13744838#comment-13744838 ] 

Ted Yu commented on HBASE-9254:
-------------------------------

Without test output, I don't have complete picture of that was going on during testSplitDaughtersNotInMeta().
{code}
"RpcServer.handler=1,port=37014" daemon prio=10 tid=0x746a9c00 nid=0x31f1 waiting for monitor entry [0x70559000]
   java.lang.Thread.State: BLOCKED (on object monitor)
  at org.apache.hadoop.hbase.master.TableNamespaceManager.get(TableNamespaceManager.java:111)
  - waiting to lock <0x7fb83a10> (a org.apache.hadoop.hbase.master.TableNamespaceManager)
  at org.apache.hadoop.hbase.master.HMaster.getNamespaceDescriptor(HMaster.java:3116)
  at org.apache.hadoop.hbase.master.HMaster.createTable(HMaster.java:1779)
  at org.apache.hadoop.hbase.master.HMaster.createTable(HMaster.java:1820)
  at org.apache.hadoop.hbase.protobuf.generated.MasterAdminProtos$MasterAdminService$2.callBlockingMethod(MasterAdminProtos.java:27720)
  at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2156)
  at org.apache.hadoop.hbase.ipc.RpcServer$Handler.run(RpcServer.java:1861)

"RpcServer.handler=0,port=37014" daemon prio=10 tid=0x7464d400 nid=0x31f0 waiting on condition [0x705aa000]
   java.lang.Thread.State: TIMED_WAITING (sleeping)
  at java.lang.Thread.sleep(Native Method)
  at org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithRetries(RpcRetryingCaller.java:150)
  - locked <0x81c8bc90> (a org.apache.hadoop.hbase.client.RpcRetryingCaller)
  at org.apache.hadoop.hbase.client.HTable.get(HTable.java:732)
  at org.apache.hadoop.hbase.master.TableNamespaceManager.get(TableNamespaceManager.java:111)
  - locked <0x7fb83a10> (a org.apache.hadoop.hbase.master.TableNamespaceManager)
  at org.apache.hadoop.hbase.master.HMaster.getNamespaceDescriptor(HMaster.java:3116)
  at org.apache.hadoop.hbase.master.HMaster.createTable(HMaster.java:1779)
  at org.apache.hadoop.hbase.master.HMaster.createTable(HMaster.java:1820)
  at org.apache.hadoop.hbase.protobuf.generated.MasterAdminProtos$MasterAdminService$2.callBlockingMethod(MasterAdminProtos.java:27720)
  at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2156)
  at org.apache.hadoop.hbase.ipc.RpcServer$Handler.run(RpcServer.java:1861)
{code}
Here is related code in TableNamespaceManager:
{code}
  public synchronized NamespaceDescriptor get(String name) throws IOException {
    Result res = table.get(new Get(Bytes.toBytes(name)));
{code}
Looks like the retrieval of NamespaceDescriptor took longer than expected.
                
> TestHBaseFsck occasionally hung
> -------------------------------
>
>                 Key: HBASE-9254
>                 URL: https://issues.apache.org/jira/browse/HBASE-9254
>             Project: HBase
>          Issue Type: Test
>            Reporter: Ted Yu
>
> From https://builds.apache.org/job/hbase-0.95-on-hadoop2/247/console :
> {code}
> "pool-1-thread-1" prio=10 tid=0x73a2a400 nid=0x2f4d in Object.wait() [0x73bdd000]
>    java.lang.Thread.State: TIMED_WAITING (on object monitor)
> 	at java.lang.Object.wait(Native Method)
> 	at org.apache.hadoop.hbase.ipc.RpcClient.call(RpcClient.java:1412)
> 	- locked <0xccdd8898> (a org.apache.hadoop.hbase.ipc.RpcClient$Call)
> 	at org.apache.hadoop.hbase.ipc.RpcClient.callBlockingMethod(RpcClient.java:1630)
> 	at org.apache.hadoop.hbase.ipc.RpcClient$BlockingRpcChannelImplementation.callBlockingMethod(RpcClient.java:1687)
> 	at org.apache.hadoop.hbase.protobuf.generated.MasterAdminProtos$MasterAdminService$BlockingStub.createTable(MasterAdminProtos.java:29365)
> 	at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$5.createTable(HConnectionManager.java:1996)
> 	at org.apache.hadoop.hbase.client.HBaseAdmin$2.call(HBaseAdmin.java:590)
> 	at org.apache.hadoop.hbase.client.HBaseAdmin$2.call(HBaseAdmin.java:586)
> 	at org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithRetries(RpcRetryingCaller.java:120)
> 	- locked <0x81c8abb0> (a org.apache.hadoop.hbase.client.RpcRetryingCaller)
> 	at org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithRetries(RpcRetryingCaller.java:98)
> 	- locked <0x81c8abb0> (a org.apache.hadoop.hbase.client.RpcRetryingCaller)
> 	at org.apache.hadoop.hbase.client.HBaseAdmin.executeCallable(HBaseAdmin.java:3087)
> 	at org.apache.hadoop.hbase.client.HBaseAdmin.createTableAsync(HBaseAdmin.java:586)
> 	at org.apache.hadoop.hbase.client.HBaseAdmin.createTable(HBaseAdmin.java:477)
> 	at org.apache.hadoop.hbase.util.TestHBaseFsck.setupTable(TestHBaseFsck.java:338)
> 	at org.apache.hadoop.hbase.util.TestHBaseFsck.testSplitDaughtersNotInMeta(TestHBaseFsck.java:1362)
> ...
>      {color:red}-1 core zombie tests{color}.  There are 1 zombie test(s): 	at org.apache.hadoop.hbase.util.TestHBaseFsck.testSplitDaughtersNotInMeta(TestHBaseFsck.java:1362)'
> {code}
> I looked at https://builds.apache.org/job/hbase-0.95-on-hadoop2/247/artifact/0.95-on-hadoop2/hbase-server/target/surefire-reports/,
> there was no test output.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira