You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Hitesh Shah (JIRA)" <ji...@apache.org> on 2014/04/04 23:29:16 UTC

[jira] [Comment Edited] (TEZ-1015) Dag failed with Invalid event: V_ROUTE_EVENT at RECOVERING

    [ https://issues.apache.org/jira/browse/TEZ-1015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13960444#comment-13960444 ] 

Hitesh Shah edited comment on TEZ-1015 at 4/4/14 9:28 PM:
----------------------------------------------------------

Comments addressed: 

   - combined into a single event transition
   - added more clarification on >= change for numInited - fixed it to compared == or numInited+1 for the recovery event
   - distance from root as start events may not be sent in recovery scenario. 

Regd duplication logic - agreed - need to figure out a way to re-use transitions. Will file a follow up jira.
  


was (Author: hitesh):
Comments addressed: 

   - combined into a single event transition
   - added more clarification on >= change for numInited - fixed it to compared == or numInited+1 for the recovery event
   - distance from root as start events may not be sent in recovery scenario. 

Regd duplication logic - agreed - need to figure out a way to re-use transitions. Will a follow up jira.
  

> Dag failed with Invalid event: V_ROUTE_EVENT at RECOVERING
> ----------------------------------------------------------
>
>                 Key: TEZ-1015
>                 URL: https://issues.apache.org/jira/browse/TEZ-1015
>             Project: Apache Tez
>          Issue Type: Bug
>    Affects Versions: 0.4.0
>            Reporter: Arpit Gupta
>         Attachments: TEZ-1015.1.patch, TEZ-1015.2.patch
>
>
> During HA tests hive job was run where NN was killed in the background and RM was also killed once. The job failed with 
> {code}
> 2014-04-03 08:59:03,253 ERROR [AsyncDispatcher event handler] org.apache.tez.dag.app.dag.impl.VertexImpl: Can't handle Invalid event V_ROUTE_EVENT on vertex Reducer 8 with vertexId vertex_1396515360091_0001_1_03 at current state RECOVERING
> org.apache.hadoop.yarn.state.InvalidStateTransitonException: Invalid event: V_ROUTE_EVENT at RECOVERING
> 	at org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:305)
> 	at org.apache.hadoop.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:46)
> 	at org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:448)
> 	at org.apache.tez.dag.app.dag.impl.VertexImpl.handle(VertexImpl.java:1210)
> 	at org.apache.tez.dag.app.dag.impl.VertexImpl.handle(VertexImpl.java:156)
> 	at org.apache.tez.dag.app.DAGAppMaster$VertexEventDispatcher.handle(DAGAppMaster.java:1659)
> 	at org.apache.tez.dag.app.DAGAppMaster$VertexEventDispatcher.handle(DAGAppMaster.java:1645)
> 	at org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:173)
> 	at org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:106)
> 	at java.lang.Thread.run(Thread.java:662)
> {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)