You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@dolphinscheduler.apache.org by GitBox <gi...@apache.org> on 2022/01/18 16:27:42 UTC
[GitHub] [dolphinscheduler] lenboo opened a new issue #8110: [Bug] [Worker/Master] Data duplication caused by worker faking death
lenboo opened a new issue #8110:
URL: https://github.com/apache/dolphinscheduler/issues/8110
### Search before asking
- [X] I had searched in the [issues](https://github.com/apache/dolphinscheduler/issues?q=is%3Aissue) and found no similar issues.
### What happened
1. Because of cpu load is too high, worker lost connection with Zookeeper.
2. Master failover the worker and restart the tasks, but cannot kill the tasks.
3. Then the same task would be run twice and result data would be duplicate.
### What you expected to happen
1. Worker should check resource before submitting task to execution thread.
2. Can worker actively reduce CPU load through killing task ?
### How to reproduce
1. CPU load is too high:
![image](https://user-images.githubusercontent.com/29528966/149973563-b8f5c39a-9930-4ab2-8997-8416eac44430.png)
2. worker lost connection with zk:
![image](https://user-images.githubusercontent.com/29528966/149975735-5e70fd4a-0b01-4e0b-a7bf-452f3f11eefc.png)
### Anything else
_No response_
### Version
dev
### Are you willing to submit PR?
- [ ] Yes I am willing to submit a PR!
### Code of Conduct
- [X] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct)
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [dolphinscheduler] delong1985 commented on issue #8110: [Bug] [Worker/Master] Data duplication caused by worker faking death
Posted by GitBox <gi...@apache.org>.
delong1985 commented on issue #8110:
URL: https://github.com/apache/dolphinscheduler/issues/8110#issuecomment-1021944278
hi i also have the similar problem
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [dolphinscheduler] github-actions[bot] commented on issue #8110: [Bug] [Worker/Master] Data duplication caused by worker faking death
Posted by GitBox <gi...@apache.org>.
github-actions[bot] commented on issue #8110:
URL: https://github.com/apache/dolphinscheduler/issues/8110#issuecomment-1015582716
Hi:
* Thank you for your feedback, we have received your issue, Please wait patiently for a reply.
* In order for us to understand your request as soon as possible, please provide detailed information、version or pictures.
* If you haven't received a reply for a long time, you can subscribe to the developer's email,Mail subscription steps reference https://dolphinscheduler.apache.org/en-us/community/development/subscribe.html ,Then write the issue URL in the email content and send question to dev@dolphinscheduler.apache.org.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [dolphinscheduler] JinyLeeChina closed issue #8110: [Bug] [Worker/Master] Data duplication caused by worker faking death
Posted by GitBox <gi...@apache.org>.
JinyLeeChina closed issue #8110:
URL: https://github.com/apache/dolphinscheduler/issues/8110
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [dolphinscheduler] caishunfeng commented on issue #8110: [Bug] [Worker/Master] Data duplication caused by worker faking death
Posted by GitBox <gi...@apache.org>.
caishunfeng commented on issue #8110:
URL: https://github.com/apache/dolphinscheduler/issues/8110#issuecomment-1038755086
What's the task type? Is it a yarn manage task?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [dolphinscheduler] JinyLeeChina commented on issue #8110: [Bug] [Worker/Master] Data duplication caused by worker faking death
Posted by GitBox <gi...@apache.org>.
JinyLeeChina commented on issue #8110:
URL: https://github.com/apache/dolphinscheduler/issues/8110#issuecomment-1047823212
Submit the yarn task through the shell. When the workload of the worker is relatively high, the shell task that has been started has not been submitted to the yarn cluster. The master has fault tolerance. When killing the yarn task, the appid is not found. The master schedules the task to other workers. When the workload of the worker is lower, the shell submits the task to yarn. At this time, the yarn cluster has two identical tasks, Data duplication will occur
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org