You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@accumulo.apache.org by GitBox <gi...@apache.org> on 2019/04/01 18:31:09 UTC

[GitHub] [accumulo] jdwoody opened a new issue #1069: TServer in inconsistent state is never removed

jdwoody opened a new issue #1069: TServer in inconsistent state is never removed
URL: https://github.com/apache/accumulo/issues/1069
 
 
   Recently observed a TServer on a node with hardware issues. The instance was alive, but not functioning. The Master polls all tservers via gatherTableInformation and this server would repeatedly throw a TTransportException on connect. After 3 failures, an attempt is made to halt the server, however, if a TTransportException is thrown, the exception is ignored and the server is assumed to be down.
   [Link to ignored Exception](https://github.com/apache/accumulo/blob/master/server/master/src/main/java/org/apache/accumulo/master/Master.java#L1230)
   
   Propose that the Zookeeper lock be removed by the master on this failure or possibly provide an option for the behavior. `LiveTServerSet.remove` could be an option.
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services