You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@dolphinscheduler.apache.org by GitBox <gi...@apache.org> on 2022/09/07 07:31:53 UTC

[GitHub] [dolphinscheduler] lordk911 opened a new issue, #11828: [Feature][worker]Worker fault tolerance process could be config to disable

lordk911 opened a new issue, #11828:
URL: https://github.com/apache/dolphinscheduler/issues/11828

   ### Search before asking
   
   - [X] I had searched in the [issues](https://github.com/apache/dolphinscheduler/issues?q=is%3Aissue) and found no similar feature requirement.
   
   
   ### Description
   
   I'm using dolphin 2.0.6,  when one worker node was down, the task running on it will be fault tolerance to other worker node,some scenario this is not good , for example, the task is spark-submit on yarn cluster mode , the worker node crash, but the spark task also running on yarn cluster . at this time fault tolerance process will run another spark task.
   
   ### Use case
   
   _No response_
   
   ### Related issues
   
   _No response_
   
   ### Are you willing to submit a PR?
   
   - [ ] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [dolphinscheduler] DarkAssassinator commented on issue #11828: [Feature][worker]Worker fault tolerance process could be config to disable

Posted by GitBox <gi...@apache.org>.
DarkAssassinator commented on issue #11828:
URL: https://github.com/apache/dolphinscheduler/issues/11828#issuecomment-1239099891

   but there is a controversial point. If task failed, but did not start any process, still no need failover?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [dolphinscheduler] DarkAssassinator commented on issue #11828: [Feature][worker]Worker fault tolerance process could be config to disable

Posted by GitBox <gi...@apache.org>.
DarkAssassinator commented on issue #11828:
URL: https://github.com/apache/dolphinscheduler/issues/11828#issuecomment-1240775796

   > > but there is a controversial point. If task failed, but did not start any process, still no need failover?
   > 
   > can master node know the task runned on the crashed worker node start any process or not?
   
   May need to add a `TaskExecutionStatus.Process_Running` status in `TaskInstance` entity.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [dolphinscheduler] lordk911 commented on issue #11828: [Feature][worker]Worker fault tolerance process could be config to disable

Posted by GitBox <gi...@apache.org>.
lordk911 commented on issue #11828:
URL: https://github.com/apache/dolphinscheduler/issues/11828#issuecomment-1240187578

   > but there is a controversial point. If task failed, but did not start any process, still no need failover?
   
   can master node know the task runned on the crashed worker node start any process or not?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [dolphinscheduler] DarkAssassinator commented on issue #11828: [Feature][worker]Worker fault tolerance process could be config to disable

Posted by GitBox <gi...@apache.org>.
DarkAssassinator commented on issue #11828:
URL: https://github.com/apache/dolphinscheduler/issues/11828#issuecomment-1239060026

   +1, i think this is a good idea


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [dolphinscheduler] github-actions[bot] commented on issue #11828: [Feature][worker]Worker fault tolerance process could be config to disable

Posted by GitBox <gi...@apache.org>.
github-actions[bot] commented on issue #11828:
URL: https://github.com/apache/dolphinscheduler/issues/11828#issuecomment-1239078722

   Thank you for your feedback, we have received your issue, Please wait patiently for a reply.
   * In order for us to understand your request as soon as possible, please provide detailed information、version or pictures.
   * If you haven't received a reply for a long time, you can [join our slack](https://s.apache.org/dolphinscheduler-slack) and send your question to channel `#troubleshooting`


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@dolphinscheduler.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org