You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@dolphinscheduler.apache.org by GitBox <gi...@apache.org> on 2021/06/02 03:40:15 UTC

[GitHub] [dolphinscheduler] chengshiwen opened a new issue #5578: [Bug][Master] ServerNodeManager WorkerGroupListener capture data change and get data failed

chengshiwen opened a new issue #5578:
URL: https://github.com/apache/dolphinscheduler/issues/5578


   **To Reproduce**
   
   Both the two master servers have the following logs:
   
   ```
   ServerNodeManager WorkerGroupListener capture data change and get data failed
   ```
   
   **Expected behavior**
   
   Bug fixed
   
   **Screenshots**
   
   ![image](https://user-images.githubusercontent.com/4902714/120416055-1e3d3700-c38f-11eb-9c5e-813de3e73c65.png)
   ![image](https://user-images.githubusercontent.com/4902714/120416065-2301eb00-c38f-11eb-90ab-46117fdeaa09.png)
   ![image](https://user-images.githubusercontent.com/4902714/120416083-2a28f900-c38f-11eb-87be-151001706bc4.png)
   ![image](https://user-images.githubusercontent.com/4902714/120416097-2e551680-c38f-11eb-9d73-fce3736d0d8a.png)
   ![image](https://user-images.githubusercontent.com/4902714/120416099-30b77080-c38f-11eb-8c19-b71894ca7b1e.png)
   
   
   
   **Which version of Dolphin Scheduler:**
    -[1.3.6]
    -[dev]
   
   **Additional context**
   Add any other context about the problem here.
   
   **Requirement or improvement**
   - Please describe about your requirements or improvement suggestions.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [dolphinscheduler] ruanwenjun edited a comment on issue #5578: [Bug][Master] ServerNodeManager WorkerGroupListener capture data change and get data failed

Posted by GitBox <gi...@apache.org>.
ruanwenjun edited a comment on issue #5578:
URL: https://github.com/apache/dolphinscheduler/issues/5578#issuecomment-852743216


   This does happen, if the two master crash at the same time(If there is only one master, it's easier to reproduce).
   Because currently we have multiple listeners that will listen the node changes.
   https://github.com/apache/dolphinscheduler/blob/cc9e5d5d34fcf2279b267cca7df37a9e80eeba07/dolphinscheduler-server/src/main/java/org/apache/dolphinscheduler/server/master/zk/ZKMasterClient.java#L217-L228
   If this listener exeucte first, it will remove the node, and will cause Master stop, and execute QuartzExecutors.shutdown method to close the datasource.
   
   The main reason is that the listener has no order.
   
   Here is a related issue #5491 and related pr #5497


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [dolphinscheduler] CalvinKirs commented on issue #5578: [Bug][Master] ServerNodeManager WorkerGroupListener capture data change and get data failed

Posted by GitBox <gi...@apache.org>.
CalvinKirs commented on issue #5578:
URL: https://github.com/apache/dolphinscheduler/issues/5578#issuecomment-890685359


   close by #5919 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [dolphinscheduler] CalvinKirs closed issue #5578: [Bug][Master] ServerNodeManager WorkerGroupListener capture data change and get data failed

Posted by GitBox <gi...@apache.org>.
CalvinKirs closed issue #5578:
URL: https://github.com/apache/dolphinscheduler/issues/5578


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [dolphinscheduler] github-actions[bot] commented on issue #5578: [Bug][Master] ServerNodeManager WorkerGroupListener capture data change and get data failed

Posted by GitBox <gi...@apache.org>.
github-actions[bot] commented on issue #5578:
URL: https://github.com/apache/dolphinscheduler/issues/5578#issuecomment-852699342


   Hi:
   * Thank you for your feedback, we have received your issue, Please wait patiently for a reply.
   * In order for us to understand your request as soon as possible, please provide detailed information、version or pictures.
   * If you haven't received a reply for a long time, you can subscribe to the developer's email,Mail subscription steps reference https://dolphinscheduler.apache.org/zh-cn/community/development/subscribe.html ,Then write the issue URL in the email content and send question to dev@dolphinscheduler.apache.org.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [dolphinscheduler] ruanwenjun commented on issue #5578: [Bug][Master] ServerNodeManager WorkerGroupListener capture data change and get data failed

Posted by GitBox <gi...@apache.org>.
ruanwenjun commented on issue #5578:
URL: https://github.com/apache/dolphinscheduler/issues/5578#issuecomment-852743216


   This does happen, if the two master are crash at the same time(If there is only one master, it's easier to reproduce).
   Because currently we have multiple listeners that will listen the node changes.
   https://github.com/apache/dolphinscheduler/blob/cc9e5d5d34fcf2279b267cca7df37a9e80eeba07/dolphinscheduler-server/src/main/java/org/apache/dolphinscheduler/server/master/zk/ZKMasterClient.java#L217-L228
   If this listener exeucte first, it will remove the node, and will cause Master stop, and execute QuartzExecutors.shutdown method to close the datasource.
   
   The main reason is that the listener has no order.
   
   Here is a related issue #5491 and related pr #5497


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org