You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Hitesh Shah (JIRA)" <ji...@apache.org> on 2013/06/11 00:08:20 UTC

[jira] [Updated] (TEZ-202) DAGSchedulerMRR does not handle failed vertices properly

     [ https://issues.apache.org/jira/browse/TEZ-202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Hitesh Shah updated TEZ-202:
----------------------------

    Summary: DAGSchedulerMRR does not handle failed vertices properly  (was: VertexImpl state machine does not handle a failure situation properly)
    
> DAGSchedulerMRR does not handle failed vertices properly
> --------------------------------------------------------
>
>                 Key: TEZ-202
>                 URL: https://issues.apache.org/jira/browse/TEZ-202
>             Project: Apache Tez
>          Issue Type: Bug
>            Reporter: Hitesh Shah
>              Labels: TEZ-0.2.0
>
> 2013-06-10 04:47:52,629 INFO [AsyncDispatcher event handler] org.apache.tez.dag.app.dag.impl.VertexImpl: Vertex failed as tasks failed. failedTasks:1
> 2013-06-10 04:47:52,657 INFO [AsyncDispatcher event handler] org.apache.tez.dag.app.dag.impl.VertexImpl: vertex_1370823798674_22_1_000001 transitioned from RUNNING to FAILED
> 2013-06-10 04:47:52,657 INFO [AsyncDispatcher event handler] org.apache.tez.dag.app.rm.container.AMContainerImpl: AMContainer container_1370823798674_0022_01_000247 transitioned from RUNNING to STOP_REQUESTED
> 2013-06-10 04:47:52,658 FATAL [AsyncDispatcher event handler] org.apache.tez.dag.app.dag.impl.DAGSchedulerMRR: vertex_1370823798674_22_1_000001 finished. Expecting org.apache.tez.dag.app.dag.impl.VertexImpl@a6a435f to finish.
> 2013-06-10 04:47:52,658 FATAL [AsyncDispatcher event handler] org.apache.hadoop.yarn.event.AsyncDispatcher: Error in dispatcher thread
>         org.apache.tez.dag.api.TezException: vertex_1370823798674_22_1_000001 finished. Expecting org.apache.tez.dag.app.dag.impl.VertexImpl@a6a435f to finish.
>                 at org.apache.tez.dag.app.dag.impl.DAGSchedulerMRR.vertexCompleted(DAGSchedulerMRR.java:56)
>                 at org.apache.tez.dag.app.dag.impl.DAGImpl$VertexCompletedTransition.transition(DAGImpl.java:1138)
>                 at org.apache.tez.dag.app.dag.impl.DAGImpl$VertexCompletedTransition.transition(DAGImpl.java:1107)
>                 at org.apache.hadoop.yarn.state.StateMachineFactory$MultipleInternalArc.doTransition(StateMachineFactory.java:382)
>                 at org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:299)
>                 at org.apache.hadoop.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:43)
>                 at org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:445)
>                 at org.apache.tez.dag.app.dag.impl.DAGImpl.handle(DAGImpl.java:591)
>                 at org.apache.tez.dag.app.dag.impl.DAGImpl.handle(DAGImpl.java:99)
>                 at org.apache.tez.dag.app.DAGAppMaster$DagEventDispatcher.handle(DAGAppMaster.java:922)                at org.apache.tez.dag.app.DAGAppMaster$DagEventDispatcher.handle(DAGAppMaster.java:918)
>                 at org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:130)                at org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:77)
>                 at java.lang.Thread.run(Thread.java:662)        2013-06-10 04:47:52,658 INFO [AsyncDispatcher event handler] org.apache.hadoop.yarn.event.AsyncDispatcher: Exiting, bbye..
>         2013-06-10 04:47:52,659 INFO [Thread-2] org.apache.tez.dag.app.DAGAppMaster: DAGAppMaster received a signal. Signaling TaskScheduler and JobHistoryEventHandler.
>         2013-06-10 04:47:52,659 INFO [Thread-2] org.apache.tez.dag.app.rm.TaskSchedulerEventHandler: RMCommunicator notified that iSignalled was : true
>         2013-06-10 04:47:52,660 INFO [Thread-2] org.apache.tez.dag.history.HistoryEventHandler: Stopping HistoryEventHandler
>         2013-06-10 04:47:52,660 INFO [Thread-2] org.apache.hadoop.yarn.service.AbstractService: Service:org.apache.tez.dag.history.HistoryEventHandler is stopped.
>         2013-06-10 04:47:52,660 ERROR [Thread-45] org.apache.tez.dag.app.rm.TaskSchedulerEventHandler: Returning, interrupted : java.lang.InterruptedException
>         2013-06-10 04:47:52,661 INFO [Thread-2] org.apache.tez.dag.app.rm.TaskSchedulerEventHandler: Setting job diagnostics to Vertex failed vertex_1370823798674_22_1_000001

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira