You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@slider.apache.org by "kyungwan nam (JIRA)" <ji...@apache.org> on 2017/08/25 09:18:00 UTC

[jira] [Updated] (SLIDER-1239) support to set how many times the command script failure tolerates

     [ https://issues.apache.org/jira/browse/SLIDER-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

kyungwan nam updated SLIDER-1239:
---------------------------------
    Attachment: SLIDER-1239.patch

the patch attached. It works well on my cluster.

> support to set how many times the command script failure tolerates
> ------------------------------------------------------------------
>
>                 Key: SLIDER-1239
>                 URL: https://issues.apache.org/jira/browse/SLIDER-1239
>             Project: Slider
>          Issue Type: Improvement
>    Affects Versions: Slider 0.92
>            Reporter: kyungwan nam
>            Assignee: kyungwan nam
>         Attachments: SLIDER-1239.patch
>
>
> if slider-appmaster receives heartbeat including FAILED from slider-agent, it will be retried 3 times.
> retrying makes sense like in situations where starting command is failed by the port which has not been freed.
> but, if the port is set to dynamic assignment, it is rare to fail by occupied port.
> also, a component which workType is BATCH (SLIDER-494) does not need to retry.
> in cases like these, user want to stop the container quickly than retrying.
> Currently, 3 times retrying is hardcoded.
> it would be good if we can set how many times failure tolerates.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)