You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@dolphinscheduler.apache.org by GitBox <gi...@apache.org> on 2022/01/18 16:27:42 UTC

[GitHub] [dolphinscheduler] lenboo opened a new issue #8110: [Bug] [Worker/Master] Data duplication caused by worker faking death

lenboo opened a new issue #8110:
URL: https://github.com/apache/dolphinscheduler/issues/8110


   ### Search before asking
   
   - [X] I had searched in the [issues](https://github.com/apache/dolphinscheduler/issues?q=is%3Aissue) and found no similar issues.
   
   
   ### What happened
   
   1. Because of cpu load is too high, worker lost connection with Zookeeper.
   2. Master failover the worker and restart the tasks, but cannot kill the tasks.
   3. Then the same  task would be run twice and result data would be duplicate.
   
   ### What you expected to happen
   
   1. Worker should check resource before submitting task to execution thread.
   2. Can worker actively reduce CPU load through killing task ?
   
   ### How to reproduce
   
   1. CPU load  is too high: 
   ![image](https://user-images.githubusercontent.com/29528966/149973563-b8f5c39a-9930-4ab2-8997-8416eac44430.png)
   2. worker lost connection with zk:
   ![image](https://user-images.githubusercontent.com/29528966/149975735-5e70fd4a-0b01-4e0b-a7bf-452f3f11eefc.png)
   
   
   
   ### Anything else
   
   _No response_
   
   ### Version
   
   dev
   
   ### Are you willing to submit PR?
   
   - [ ] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [dolphinscheduler] delong1985 commented on issue #8110: [Bug] [Worker/Master] Data duplication caused by worker faking death

Posted by GitBox <gi...@apache.org>.
delong1985 commented on issue #8110:
URL: https://github.com/apache/dolphinscheduler/issues/8110#issuecomment-1021944278


   hi i also have the similar problem 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [dolphinscheduler] github-actions[bot] commented on issue #8110: [Bug] [Worker/Master] Data duplication caused by worker faking death

Posted by GitBox <gi...@apache.org>.
github-actions[bot] commented on issue #8110:
URL: https://github.com/apache/dolphinscheduler/issues/8110#issuecomment-1015582716


   Hi:
   * Thank you for your feedback, we have received your issue, Please wait patiently for a reply.
   * In order for us to understand your request as soon as possible, please provide detailed information、version or pictures.
   * If you haven't received a reply for a long time, you can subscribe to the developer's email,Mail subscription steps reference https://dolphinscheduler.apache.org/en-us/community/development/subscribe.html ,Then write the issue URL in the email content and send question to dev@dolphinscheduler.apache.org.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [dolphinscheduler] JinyLeeChina closed issue #8110: [Bug] [Worker/Master] Data duplication caused by worker faking death

Posted by GitBox <gi...@apache.org>.
JinyLeeChina closed issue #8110:
URL: https://github.com/apache/dolphinscheduler/issues/8110


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [dolphinscheduler] caishunfeng commented on issue #8110: [Bug] [Worker/Master] Data duplication caused by worker faking death

Posted by GitBox <gi...@apache.org>.
caishunfeng commented on issue #8110:
URL: https://github.com/apache/dolphinscheduler/issues/8110#issuecomment-1038755086


   What's the task type? Is it a yarn manage task? 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [dolphinscheduler] JinyLeeChina commented on issue #8110: [Bug] [Worker/Master] Data duplication caused by worker faking death

Posted by GitBox <gi...@apache.org>.
JinyLeeChina commented on issue #8110:
URL: https://github.com/apache/dolphinscheduler/issues/8110#issuecomment-1047823212


   Submit the yarn task through the shell. When the workload of the worker is relatively high, the shell task that has been started has not been submitted to the yarn cluster. The master has fault tolerance. When killing the yarn task, the appid is not found. The master schedules the task to other workers. When the workload of the worker is lower, the shell submits the task to yarn. At this time, the yarn cluster has two identical tasks, Data duplication will occur


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org