You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Ted Yu (JIRA)" <ji...@apache.org> on 2018/02/03 03:44:00 UTC

[jira] [Commented] (HBASE-19925) Delete an unreachable peer will triggers all regionservers abort

    [ https://issues.apache.org/jira/browse/HBASE-19925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16351229#comment-16351229 ] 

Ted Yu commented on HBASE-19925:
--------------------------------

It seems isPeerEnabled() can call getPeer() first.
If the return value is null, don't proceed to calling getStatusOfPeer().

Do you want to provide a patch ?

Thanks

> Delete an unreachable peer will triggers all regionservers abort
> ----------------------------------------------------------------
>
>                 Key: HBASE-19925
>                 URL: https://issues.apache.org/jira/browse/HBASE-19925
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Yun Zhao
>            Priority: Critical
>
> Add an unreachable peer
> {code:java}
> add_peer '4', CLUSTER_KEY => "server1.cie.com:2181:/hbase"{code}
> After a while to delete it,Regionserver will appear in the following log and stop.
> {code:java}
> 2018-02-02 20:04:25,959 INFO [main-EventThread.replicationSource,4] regionserver.ReplicationSource: Replicating 5467de52-dc46-45be-902c-110dd7a83e06 -> null
> 2018-02-02 20:04:25,960 ERROR [main-EventThread.replicationSource,4.replicationSource.xxxx.com%2C16020%2C1515498473547.default,4] regionserver.ReplicationSource: Unexpected exception in ReplicationSourceWorkerThread, currentPath=null
> java.lang.IllegalArgumentException: Peer with id= 4 is not connected
>  at org.apache.hadoop.hbase.replication.ReplicationPeersZKImpl.getStatusOfPeer(ReplicationPeersZKImpl.java:207)
>  at org.apache.hadoop.hbase.replication.regionserver.ReplicationSource.isPeerEnabled(ReplicationSource.java:327)
>  at org.apache.hadoop.hbase.replication.regionserver.ReplicationSource$ReplicationSourceWorkerThread.run(ReplicationSource.java:512)
> 2018-02-02 20:04:25,960 INFO [main-EventThread.replicationSource,4.replicationSource.xxxx.com%2C16020%2C1515498473547.default,4] regionserver.HRegionServer: STOPPED: Unexpected exception in ReplicationSourceWorkerThread{code}
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)