You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "binlijin (JIRA)" <ji...@apache.org> on 2016/10/28 03:49:58 UTC

[jira] [Commented] (HBASE-16960) RegionServer hang when aborting

    [ https://issues.apache.org/jira/browse/HBASE-16960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15614193#comment-15614193 ] 

binlijin commented on HBASE-16960:
----------------------------------

The problem is MemStoreFlusher.1 call FSHLog.sync and do not get a result so not return.
{code}
"MemStoreFlusher.1" prio=10 tid=0x00007f553e0dc800 nid=0x27c91 in Object.wait() [0x00007f5519d73000]
   java.lang.Thread.State: TIMED_WAITING (on object monitor)
	at java.lang.Object.wait(Native Method)
	at org.apache.hadoop.hbase.regionserver.wal.SyncFuture.get(SyncFuture.java:167)
	- locked <0x00007f593d16e128> (a org.apache.hadoop.hbase.regionserver.wal.SyncFuture)
	at org.apache.hadoop.hbase.regionserver.wal.FSHLog.blockOnSync(FSHLog.java:1523)
	at org.apache.hadoop.hbase.regionserver.wal.FSHLog.publishSyncThenBlockOnCompletion(FSHLog.java:1517)
	at org.apache.hadoop.hbase.regionserver.wal.FSHLog.sync(FSHLog.java:1607)
	at org.apache.hadoop.hbase.regionserver.HRegion.internalPrepareFlushCache(HRegion.java:2289)
	at org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2110)
	at org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2075)
	at org.apache.hadoop.hbase.regionserver.HRegion.flushcache(HRegion.java:1967)
	at org.apache.hadoop.hbase.regionserver.HRegion.flush(HRegion.java:1893)
	at org.apache.hadoop.hbase.regionserver.MemStoreFlusher.flushRegion(MemStoreFlusher.java:510)
	at org.apache.hadoop.hbase.regionserver.MemStoreFlusher.flushRegion(MemStoreFlusher.java:471)
	at org.apache.hadoop.hbase.regionserver.MemStoreFlusher.access$800(MemStoreFlusher.java:75)
	at org.apache.hadoop.hbase.regionserver.MemStoreFlusher$FlushHandler.run(MemStoreFlusher.java:259)
	at java.lang.Thread.run(Thread.java:756)
{code}

> RegionServer hang when aborting
> -------------------------------
>
>                 Key: HBASE-16960
>                 URL: https://issues.apache.org/jira/browse/HBASE-16960
>             Project: HBase
>          Issue Type: Bug
>            Reporter: binlijin
>         Attachments: rs1081.jstack
>
>
> We see regionserver hang when aborting several times and cause all regions on this regionserver out of service and then all affected applications stop works.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)