You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Till Rohrmann (JIRA)" <ji...@apache.org> on 2018/07/23 22:12:00 UTC

[jira] [Resolved] (FLINK-9838) Slot request failed Exceptions after completing a job

     [ https://issues.apache.org/jira/browse/FLINK-9838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Till Rohrmann resolved FLINK-9838.
----------------------------------
       Resolution: Fixed
    Fix Version/s: 1.6.0
                   1.5.2

Fixed via
master: 4e616a8362744c15a71e0e57ad68fbb52266c837
1.6.0: 37b82ba1ba051f0bda006157e3fee6eccccd3521
1.5.2: e8b70cc2bc47ad6aa52104814e28d6e35e7f8eb9

> Slot request failed Exceptions after completing a job
> -----------------------------------------------------
>
>                 Key: FLINK-9838
>                 URL: https://issues.apache.org/jira/browse/FLINK-9838
>             Project: Flink
>          Issue Type: Bug
>          Components: Distributed Coordination
>    Affects Versions: 1.5.1, 1.6.0
>            Reporter: Nico Kruber
>            Assignee: Till Rohrmann
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 1.5.2, 1.6.0
>
>
> Currently, after a job finished, e.g. the following one, several exceptions are logged (at INFO level) about failed slot requests although the job has run successfully.
> {code}
> StreamExecutionEnvironment env = StreamExecutionEnvironment.getExecutionEnvironment();
> env.fromElements(1, 2, 3, 4).print();
> env.execute();
> {code}
> {code}
> 16:28:16,106 INFO  org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager  - Closing the SlotManager.
> 16:28:16,106 INFO  org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager  - Suspending the SlotManager.
> 16:28:16,106 INFO  org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager  - Unregister TaskManager aa20e76adb9aee0cdadc50dbc06ea208 from the SlotManager.
> 16:28:16,107 INFO  org.apache.flink.runtime.resourcemanager.StandaloneResourceManager  - Slot request with allocation id f99ff6d66f7bc618a9ee6e9470e0cdb1 for job 1bdaafd1072e210790790b99e7741b6a failed.
> org.apache.flink.util.FlinkException: The assigned slot b21f8807-5d0a-4e53-9e55-b6522b4a41c0_0 was removed.
> 	at org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager.removeSlot(SlotManager.java:786)
> 	at org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager.removeSlots(SlotManager.java:756)
> 	at org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager.internalUnregisterTaskManager(SlotManager.java:948)
> 	at org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager.unregisterTaskManager(SlotManager.java:372)
> 	at org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager.suspend(SlotManager.java:234)
> 	at org.apache.flink.runtime.resourcemanager.slotmanager.SlotManager.close(SlotManager.java:251)
> 	at org.apache.flink.runtime.resourcemanager.ResourceManager.postStop(ResourceManager.java:224)
> 	at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.postStop(AkkaRpcActor.java:105)
> 	at org.apache.flink.runtime.rpc.akka.FencedAkkaRpcActor.postStop(FencedAkkaRpcActor.java:40)
> 	at akka.actor.Actor$class.aroundPostStop(Actor.scala:515)
> 	at akka.actor.UntypedActor.aroundPostStop(UntypedActor.scala:95)
> 	at akka.actor.dungeon.FaultHandling$class.akka$actor$dungeon$FaultHandling$$finishTerminate(FaultHandling.scala:210)
> 	at akka.actor.dungeon.FaultHandling$class.terminate(FaultHandling.scala:172)
> 	at akka.actor.ActorCell.terminate(ActorCell.scala:374)
> 	at akka.actor.ActorCell.invokeAll$1(ActorCell.scala:467)
> 	at akka.actor.ActorCell.systemInvoke(ActorCell.scala:483)
> 	at akka.dispatch.Mailbox.processAllSystemMessages(Mailbox.scala:282)
> 	at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:260)
> 	at akka.dispatch.Mailbox.run(Mailbox.scala:224)
> 	at akka.dispatch.Mailbox.exec(Mailbox.scala:234)
> 	at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
> 	at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
> 	at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
> 	at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
> 16:28:16,109 INFO  org.apache.flink.runtime.taskexecutor.TaskExecutor            - Stopping TaskExecutor akka://flink/user/taskmanager_0.
> 16:28:16,110 INFO  org.apache.flink.runtime.state.TaskExecutorLocalStateStoresManager  - Shutting down TaskExecutorLocalStateStoresManager.
> 16:28:16,109 INFO  org.apache.flink.runtime.dispatcher.StandaloneDispatcher      - Stopping dispatcher akka://flink/user/dispatcher421f3c27-5248-40d4-b219-f0c23480bd6f.
> 16:28:16,111 INFO  org.apache.flink.runtime.dispatcher.StandaloneDispatcher      - Stopping all currently running jobs of dispatcher akka://flink/user/dispatcher421f3c27-5248-40d4-b219-f0c23480bd6f.
> ...
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)