You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@dolphinscheduler.apache.org by GitBox <gi...@apache.org> on 2021/12/24 02:10:07 UTC

[GitHub] [dolphinscheduler] zwZjut commented on issue #7440: [Bug] [WorkerServer] workerServer always restart: judged to death

zwZjut commented on issue #7440:
URL: https://github.com/apache/dolphinscheduler/issues/7440#issuecomment-1000609789


   > the death mechanism is work with master/worker's suicide.
   > 
   > when the zk session timeout is too short(default is 60s), the worker or master loss connection from the zk. remove the `nodes/master/ip:port` node. other master/worker will watch this event and add the node to `nodes/dead-servers/ip:port`.
   > 
   > so when the master/worker's network recover and reconnect to zk . it will revieve the dead node add event, and kill itself.
   > 
   > describe some more detail of your scenario like ref issue: #6880
   
   failover logic bug , fixed in 2.0.2


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org