You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Daryn Sharp (JIRA)" <ji...@apache.org> on 2013/05/16 22:19:16 UTC

[jira] [Created] (YARN-690) RM exits on token cancel/renew problems

Daryn Sharp created YARN-690:
--------------------------------

             Summary: RM exits on token cancel/renew problems
                 Key: YARN-690
                 URL: https://issues.apache.org/jira/browse/YARN-690
             Project: Hadoop YARN
          Issue Type: Bug
          Components: resourcemanager
    Affects Versions: 0.23.7, 3.0.0, 2.0.5-beta
            Reporter: Daryn Sharp
            Assignee: Daryn Sharp
            Priority: Blocker


The DelegationTokenRenewer thread is critical to the RM.  When a non-IOException occurs, the thread calls System.exit to prevent the RM from running w/o the thread.  It should be exiting only on non-RuntimeExceptions.

The problem is especially bad in 23 because the yarn protobuf layer converts IOExceptions into UndeclaredThrowableExceptions (RuntimeException) which causes the renewer to abort the process.  An UnknownHostException takes down the RM...

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira