You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Vinod Kumar Vavilapalli (JIRA)" <ji...@apache.org> on 2013/08/24 22:59:51 UTC

[jira] [Updated] (YARN-1094) RM restart throws Null pointer Exception in Secure Env

     [ https://issues.apache.org/jira/browse/YARN-1094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Vinod Kumar Vavilapalli updated YARN-1094:
------------------------------------------

    Attachment: YARN-1094-20130824.txt

Here's a patch that fixes this bug.
 - Moved delegationTokenRenewer's start to be explicit and before the state-store starts.
 - Made GetDelegationTokenRequest.newInstance as static. This was a pre-existing bug!
 - Made fixes to consistently use RMDelegationTokenRenewer only in secure mode
 - Some cosmetic changes to call tokenRenewer as more specifically delegationTokenRenewer

TestRMRestart.testDelegationTokenRestoredInDelegationTokenRenewer fails with the same NPE without the code changes and passes with.

Also tested this on a single node secure setup where I first reproduced the NPE easily and verified that RM restart works as expected after the patch.
                
> RM restart throws Null pointer Exception in Secure Env
> ------------------------------------------------------
>
>                 Key: YARN-1094
>                 URL: https://issues.apache.org/jira/browse/YARN-1094
>             Project: Hadoop YARN
>          Issue Type: Bug
>         Environment: secure env
>            Reporter: yeshavora
>            Assignee: Vinod Kumar Vavilapalli
>            Priority: Blocker
>         Attachments: YARN-1094-20130824.txt
>
>
> Enable rmrestart feature And restart Resorce Manager while a job is running.
> Resorce Manager fails to start with below error
> 2013-08-23 17:57:40,705 INFO  resourcemanager.RMAppManager (RMAppManager.java:recover(370)) - Recovering application application_1377280618693_0001
> 2013-08-23 17:57:40,763 ERROR resourcemanager.ResourceManager (ResourceManager.java:serviceStart(617)) - Failed to load/recover state
> java.lang.NullPointerException
>         at org.apache.hadoop.yarn.server.resourcemanager.security.DelegationTokenRenewer.setTimerForTokenRenewal(DelegationTokenRenewer.java:371)
>         at org.apache.hadoop.yarn.server.resourcemanager.security.DelegationTokenRenewer.addApplication(DelegationTokenRenewer.java:307)
>         at org.apache.hadoop.yarn.server.resourcemanager.RMAppManager.submitApplication(RMAppManager.java:291)
>         at org.apache.hadoop.yarn.server.resourcemanager.RMAppManager.recover(RMAppManager.java:371)
>         at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.recover(ResourceManager.java:819)
>         at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.serviceStart(ResourceManager.java:613)
>         at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
>         at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.main(ResourceManager.java:832)
> 2013-08-23 17:57:40,766 INFO  util.ExitUtil (ExitUtil.java:terminate(124)) - Exiting with status 1
>                                                                                                     

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira