You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Till Rohrmann (JIRA)" <ji...@apache.org> on 2015/02/14 20:14:12 UTC
[jira] [Commented] (FLINK-1546) Failed job causes JobManager to
shutdown due to uncatched WebFrontend exception
[ https://issues.apache.org/jira/browse/FLINK-1546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14321649#comment-14321649 ]
Till Rohrmann commented on FLINK-1546:
--------------------------------------
The problem with the uncaught exception in the actor thread is fixed with 589b539c5acdd25f53ef6c9a453198a960ba93d5.
However, the interesting question is why the system complains that the current job is not in a terminal state. The log line before says that it switched to {{FAILED}}. Can you reproduce the error [~rmetzger]? Maybe we can add in which state the job is when it throws the exception.
> Failed job causes JobManager to shutdown due to uncatched WebFrontend exception
> -------------------------------------------------------------------------------
>
> Key: FLINK-1546
> URL: https://issues.apache.org/jira/browse/FLINK-1546
> Project: Flink
> Issue Type: Bug
> Components: JobManager
> Affects Versions: 0.9
> Reporter: Robert Metzger
>
> {code}
> 16:59:26,588 INFO org.apache.flink.yarn.ApplicationMaster$$anonfun$2$$anon$1 - Status of job ef19b2b201d4b81f031334cb76eadc78 (Basic Page Rank Example) changed to FAILEDCleanup job ef19b2b201d4b81f031334cb76eadc78..
> 16:59:26,591 ERROR akka.actor.OneForOneStrategy - Can only archive the job from a terminal state
> java.lang.IllegalStateException: Can only archive the job from a terminal state
> at org.apache.flink.runtime.executiongraph.ExecutionGraph.prepareForArchiving(ExecutionGraph.java:648)
> at org.apache.flink.runtime.jobmanager.JobManager.org$apache$flink$runtime$jobmanager$JobManager$$removeJob(JobManager.scala:508)
> at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$receiveWithLogMessages$1.applyOrElse(JobManager.scala:271)
> at scala.runtime.AbstractPartialFunction$mcVL$sp.apply$mcVL$sp(AbstractPartialFunction.scala:33)
> at scala.runtime.AbstractPartialFunction$mcVL$sp.apply(AbstractPartialFunction.scala:33)
> at scala.runtime.AbstractPartialFunction$mcVL$sp.apply(AbstractPartialFunction.scala:25)
> at org.apache.flink.yarn.YarnJobManager$$anonfun$receiveYarnMessages$1.applyOrElse(YarnJobManager.scala:70)
> at scala.PartialFunction$OrElse.apply(PartialFunction.scala:162)
> at org.apache.flink.runtime.ActorLogMessages$$anon$1.apply(ActorLogMessages.scala:37)
> at org.apache.flink.runtime.ActorLogMessages$$anon$1.apply(ActorLogMessages.scala:30)
> at scala.PartialFunction$class.applyOrElse(PartialFunction.scala:118)
> at org.apache.flink.runtime.ActorLogMessages$$anon$1.applyOrElse(ActorLogMessages.scala:30)
> at akka.actor.Actor$class.aroundReceive(Actor.scala:465)
> at org.apache.flink.runtime.jobmanager.JobManager.aroundReceive(JobManager.scala:86)
> at akka.actor.ActorCell.receiveMessage(ActorCell.scala:516)
> at akka.actor.ActorCell.invoke(ActorCell.scala:487)
> at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:254)
> at akka.dispatch.Mailbox.run(Mailbox.scala:221)
> at akka.dispatch.Mailbox.exec(Mailbox.scala:231)
> at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
> at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
> at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
> at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
> 16:59:26,595 INFO org.apache.flink.yarn.ApplicationMaster$$anonfun$2$$anon$1 - Stopping webserver.
> 16:59:26,654 INFO org.apache.flink.yarn.ApplicationMaster$$anonfun$2$$anon$1 - Stopped webserver.
> 16:59:26,656 INFO org.apache.flink.yarn.ApplicationMaster$$anonfun$2$$anon$1 - Stopping job manager akka://flink/user/jobmanager.
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)