You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Rohith (JIRA)" <ji...@apache.org> on 2014/07/24 14:37:39 UTC

[jira] [Commented] (YARN-2349) InvalidStateTransitonException after RM switch

    [ https://issues.apache.org/jira/browse/YARN-2349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14073158#comment-14073158 ] 

Rohith commented on YARN-2349:
------------------------------

This is basically configurations in capacity-scheduler.xml of both RM's does not match. During recovery application is moved  New->ACCEPTED synchronously by adding application to scheduler. Before scheduler knows about appilcation,RMAppImpl is moved to ACCEPTED. Any exception(for serveral reason) during submitApplication,APP_REJECTED event is triggered which inturn cause InvaliStateTransition.
For fixing it, either enfource both RM's configuration should be same adding note OR handle APP_REJECTED event at ACCEPTED state.

> InvalidStateTransitonException after RM switch
> ----------------------------------------------
>
>                 Key: YARN-2349
>                 URL: https://issues.apache.org/jira/browse/YARN-2349
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: resourcemanager
>    Affects Versions: 2.4.1
>            Reporter: Nishan Shetty
>
> {code}
> 2014-07-23 19:22:28,272 INFO org.apache.hadoop.ipc.Server: IPC Server Responder: starting
> 2014-07-23 19:22:28,273 INFO org.apache.hadoop.ipc.Server: IPC Server listener on 45018: starting
> 2014-07-23 19:22:28,266 ERROR org.apache.hadoop.yarn.server.resourcemanager.rmapp.RMAppImpl: Can't handle this event at current state
> org.apache.hadoop.yarn.state.InvalidStateTransitonException: Invalid event: APP_REJECTED at ACCEPTED
>  at org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:305)
>  at org.apache.hadoop.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:46)
>  at org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:448)
>  at org.apache.hadoop.yarn.server.resourcemanager.rmapp.RMAppImpl.handle(RMAppImpl.java:635)
>  at org.apache.hadoop.yarn.server.resourcemanager.rmapp.RMAppImpl.handle(RMAppImpl.java:83)
>  at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager$ApplicationEventDispatcher.handle(ResourceManager.java:706)
>  at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager$ApplicationEventDispatcher.handle(ResourceManager.java:690)
>  at org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:173)
>  at org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:106)
>  at java.lang.Thread.run(Thread.java:662)
> 2014-07-23 19:22:28,283 INFO org.mortbay.log: Stopped SelectChannelConnector@10.18.40.84:45020
> 2014-07-23 19:22:28,291 ERROR org.apache.hadoop.yarn.server.applicationhistoryservice.FileSystemApplicationHistoryStore: Error when openning history file of application application_1406116264351_0007
> {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)