You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Till Rohrmann (JIRA)" <ji...@apache.org> on 2015/02/14 20:14:12 UTC

[jira] [Commented] (FLINK-1546) Failed job causes JobManager to shutdown due to uncatched WebFrontend exception

    [ https://issues.apache.org/jira/browse/FLINK-1546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14321649#comment-14321649 ] 

Till Rohrmann commented on FLINK-1546:
--------------------------------------

The problem with the uncaught exception in the actor thread is fixed with 589b539c5acdd25f53ef6c9a453198a960ba93d5. 

However, the interesting question is why the system complains that the current job is not in a terminal state. The log line before says that it switched to {{FAILED}}. Can you reproduce the error [~rmetzger]? Maybe we can add in which state the job is when it throws the exception.

> Failed job causes JobManager to shutdown due to uncatched WebFrontend exception
> -------------------------------------------------------------------------------
>
>                 Key: FLINK-1546
>                 URL: https://issues.apache.org/jira/browse/FLINK-1546
>             Project: Flink
>          Issue Type: Bug
>          Components: JobManager
>    Affects Versions: 0.9
>            Reporter: Robert Metzger
>
> {code}
> 16:59:26,588 INFO  org.apache.flink.yarn.ApplicationMaster$$anonfun$2$$anon$1    - Status of job ef19b2b201d4b81f031334cb76eadc78 (Basic Page Rank Example) changed to FAILEDCleanup job ef19b2b201d4b81f031334cb76eadc78..
> 16:59:26,591 ERROR akka.actor.OneForOneStrategy                                  - Can only archive the job from a terminal state
> java.lang.IllegalStateException: Can only archive the job from a terminal state
> 	at org.apache.flink.runtime.executiongraph.ExecutionGraph.prepareForArchiving(ExecutionGraph.java:648)
> 	at org.apache.flink.runtime.jobmanager.JobManager.org$apache$flink$runtime$jobmanager$JobManager$$removeJob(JobManager.scala:508)
> 	at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$receiveWithLogMessages$1.applyOrElse(JobManager.scala:271)
> 	at scala.runtime.AbstractPartialFunction$mcVL$sp.apply$mcVL$sp(AbstractPartialFunction.scala:33)
> 	at scala.runtime.AbstractPartialFunction$mcVL$sp.apply(AbstractPartialFunction.scala:33)
> 	at scala.runtime.AbstractPartialFunction$mcVL$sp.apply(AbstractPartialFunction.scala:25)
> 	at org.apache.flink.yarn.YarnJobManager$$anonfun$receiveYarnMessages$1.applyOrElse(YarnJobManager.scala:70)
> 	at scala.PartialFunction$OrElse.apply(PartialFunction.scala:162)
> 	at org.apache.flink.runtime.ActorLogMessages$$anon$1.apply(ActorLogMessages.scala:37)
> 	at org.apache.flink.runtime.ActorLogMessages$$anon$1.apply(ActorLogMessages.scala:30)
> 	at scala.PartialFunction$class.applyOrElse(PartialFunction.scala:118)
> 	at org.apache.flink.runtime.ActorLogMessages$$anon$1.applyOrElse(ActorLogMessages.scala:30)
> 	at akka.actor.Actor$class.aroundReceive(Actor.scala:465)
> 	at org.apache.flink.runtime.jobmanager.JobManager.aroundReceive(JobManager.scala:86)
> 	at akka.actor.ActorCell.receiveMessage(ActorCell.scala:516)
> 	at akka.actor.ActorCell.invoke(ActorCell.scala:487)
> 	at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:254)
> 	at akka.dispatch.Mailbox.run(Mailbox.scala:221)
> 	at akka.dispatch.Mailbox.exec(Mailbox.scala:231)
> 	at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
> 	at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
> 	at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
> 	at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
> 16:59:26,595 INFO  org.apache.flink.yarn.ApplicationMaster$$anonfun$2$$anon$1    - Stopping webserver.
> 16:59:26,654 INFO  org.apache.flink.yarn.ApplicationMaster$$anonfun$2$$anon$1    - Stopped webserver.
> 16:59:26,656 INFO  org.apache.flink.yarn.ApplicationMaster$$anonfun$2$$anon$1    - Stopping job manager akka://flink/user/jobmanager.
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)