You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@slider.apache.org by "Billie Rinaldi (JIRA)" <ji...@apache.org> on 2017/07/21 15:03:00 UTC

[jira] [Created] (SLIDER-1233) Lost nodes should not contribute to container failures

Billie Rinaldi created SLIDER-1233:
--------------------------------------

             Summary: Lost nodes should not contribute to container failures
                 Key: SLIDER-1233
                 URL: https://issues.apache.org/jira/browse/SLIDER-1233
             Project: Slider
          Issue Type: Bug
          Components: core
            Reporter: Billie Rinaldi
            Assignee: Billie Rinaldi
             Fix For: Slider 1.0.0


If a container completes due to an NM being lost, we should not count this towards container failures that may eventually cause the AM to fail the application. We are already using a ContainerOutcome of Completed (rather than Failed) for this type of container exit, so we just need to change the failure counting in that case. Other failure types associated with Completed are killed by the AM, killed by the RM, and killed after app completion, none of which need to contribute to container failures.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)