You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-dev@hadoop.apache.org by "Anubhav Dhoot (JIRA)" <ji...@apache.org> on 2015/02/19 19:46:12 UTC

[jira] [Created] (YARN-3229) Incorrect processing of container as LOST on Interruption during NM shutdown

Anubhav Dhoot created YARN-3229:
-----------------------------------

             Summary: Incorrect processing of container as LOST on Interruption during NM shutdown
                 Key: YARN-3229
                 URL: https://issues.apache.org/jira/browse/YARN-3229
             Project: Hadoop YARN
          Issue Type: Bug
            Reporter: Anubhav Dhoot


YARN-2846 fixed the issue of writing to the state store incorrectly that the process is LOST. But even after that we still process the ContainerExitEvent. If notInterrupted is false in RecoveredContainerLaunch#call we should skip the following
{noformat}
 if (retCode != 0) {
      LOG.warn("Recovered container exited with a non-zero exit code "
          + retCode);
      this.dispatcher.getEventHandler().handle(new ContainerExitEvent(
          containerId,
          ContainerEventType.CONTAINER_EXITED_WITH_FAILURE, retCode,
          "Container exited with a non-zero exit code " + retCode));
      return retCode;
    }
{noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)