You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Hitesh Shah (JIRA)" <ji...@apache.org> on 2014/04/04 23:29:16 UTC
[jira] [Comment Edited] (TEZ-1015) Dag failed with Invalid event:
V_ROUTE_EVENT at RECOVERING
[ https://issues.apache.org/jira/browse/TEZ-1015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13960444#comment-13960444 ]
Hitesh Shah edited comment on TEZ-1015 at 4/4/14 9:28 PM:
----------------------------------------------------------
Comments addressed:
- combined into a single event transition
- added more clarification on >= change for numInited - fixed it to compared == or numInited+1 for the recovery event
- distance from root as start events may not be sent in recovery scenario.
Regd duplication logic - agreed - need to figure out a way to re-use transitions. Will file a follow up jira.
was (Author: hitesh):
Comments addressed:
- combined into a single event transition
- added more clarification on >= change for numInited - fixed it to compared == or numInited+1 for the recovery event
- distance from root as start events may not be sent in recovery scenario.
Regd duplication logic - agreed - need to figure out a way to re-use transitions. Will a follow up jira.
> Dag failed with Invalid event: V_ROUTE_EVENT at RECOVERING
> ----------------------------------------------------------
>
> Key: TEZ-1015
> URL: https://issues.apache.org/jira/browse/TEZ-1015
> Project: Apache Tez
> Issue Type: Bug
> Affects Versions: 0.4.0
> Reporter: Arpit Gupta
> Attachments: TEZ-1015.1.patch, TEZ-1015.2.patch
>
>
> During HA tests hive job was run where NN was killed in the background and RM was also killed once. The job failed with
> {code}
> 2014-04-03 08:59:03,253 ERROR [AsyncDispatcher event handler] org.apache.tez.dag.app.dag.impl.VertexImpl: Can't handle Invalid event V_ROUTE_EVENT on vertex Reducer 8 with vertexId vertex_1396515360091_0001_1_03 at current state RECOVERING
> org.apache.hadoop.yarn.state.InvalidStateTransitonException: Invalid event: V_ROUTE_EVENT at RECOVERING
> at org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:305)
> at org.apache.hadoop.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:46)
> at org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:448)
> at org.apache.tez.dag.app.dag.impl.VertexImpl.handle(VertexImpl.java:1210)
> at org.apache.tez.dag.app.dag.impl.VertexImpl.handle(VertexImpl.java:156)
> at org.apache.tez.dag.app.DAGAppMaster$VertexEventDispatcher.handle(DAGAppMaster.java:1659)
> at org.apache.tez.dag.app.DAGAppMaster$VertexEventDispatcher.handle(DAGAppMaster.java:1645)
> at org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:173)
> at org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:106)
> at java.lang.Thread.run(Thread.java:662)
> {code}
--
This message was sent by Atlassian JIRA
(v6.2#6252)