You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@dolphinscheduler.apache.org by GitBox <gi...@apache.org> on 2020/03/06 06:49:28 UTC

[GitHub] [incubator-dolphinscheduler] nd49782 commented on issue #2065: 随着调度任务增多,ZooKeeper同步缓慢导致Socket超时,从而引起所有Master服务挂掉

nd49782 commented on issue #2065: 随着调度任务增多,ZooKeeper同步缓慢导致Socket超时,从而引起所有Master服务挂掉
URL: https://github.com/apache/incubator-dolphinscheduler/issues/2065#issuecomment-595626543
 
 
   在开发人员佳竹的帮助下,发现zk的一个节点网络延迟非常高,疑似网络不稳定造成的。
   我又调整了DS的如下参数:
   > zkSessionTimeout
   zkConnectionTimeout
   zkRetrySleep
   zkRetryMaxtime
   
   之后监控观察了几天,刚开始几天ZK依旧会报SocketTimeoutException,但是Master至今未出现挂掉的情况。
   在近几天的监控观察中,ZK一切正常,没有报任何异常,之前延迟高的那台节点也恢复正常了。
   
   至于网络延迟的监控情况已联系运维人员。
   后续我会持续观察这个问题,如有进一步进展也会反馈给社区,谢谢!

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services