You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user-zh@flink.apache.org by yidan zhao <hi...@gmail.com> on 2021/11/29 06:54:25 UTC

no more slot 这种错误是啥情况,slot肯定足够。

如题, slot足够情况下,提交任务后报错然后不断重启中。

org.apache.flink.util.FlinkException: TaskExecutor akka.tcp://flink@10.146.
62.34:57289/user/rpc/taskmanager_0 has no more allocated slots for job
d5152102d0ff19d8cbcff4490c431b7f.
    at org.apache.flink.runtime.taskexecutor.TaskExecutor
.closeJobManagerConnectionIfNoAllocatedResources(TaskExecutor.java:1948)
    at org.apache.flink.runtime.taskexecutor.TaskExecutor.freeSlotInternal(
TaskExecutor.java:1929)
    at org.apache.flink.runtime.taskexecutor.TaskExecutor.timeoutSlot(
TaskExecutor.java:1962)
    at org.apache.flink.runtime.taskexecutor.TaskExecutor.access$3000(
TaskExecutor.java:182)
    at org.apache.flink.runtime.taskexecutor.TaskExecutor$SlotActionsImpl
.lambda$timeoutSlot$1(TaskExecutor.java:2320)
    at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRunAsync(
AkkaRpcActor.java:440)
    at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcMessage(
AkkaRpcActor.java:208)
    at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleMessage(
AkkaRpcActor.java:158)
    at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:26)
    at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:21)
    at scala.PartialFunction$class.applyOrElse(PartialFunction.scala:123)
    at akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:21)
    at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:170)
    at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171)
    at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171)
    at akka.actor.Actor$class.aroundReceive(Actor.scala:517)
    at akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:225)
    at akka.actor.ActorCell.receiveMessage(ActorCell.scala:592)
    at akka.actor.ActorCell.invoke(ActorCell.scala:561)
    at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:258)
    at akka.dispatch.Mailbox.run(Mailbox.scala:225)
    at akka.dispatch.Mailbox.exec(Mailbox.scala:235)
    at akka.dispatch.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
    at akka.dispatch.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool
.java:1339)
    at akka.dispatch.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
    at akka.dispatch.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread
.java:107)

Re: no more slot 这种错误是啥情况,slot肯定足够。

Posted by yidan zhao <hi...@gmail.com>.
可能和基于保存点恢复有关? 我去除保存点,直接重启就一次OK了。之前基于保存点重启试了5次都一样的问题。
任务是SQL任务,kafka2hive的任务,带compact算子。

yidan zhao <hi...@gmail.com> 于2021年11月29日周一 下午2:54写道:

> 如题, slot足够情况下,提交任务后报错然后不断重启中。
>
> org.apache.flink.util.FlinkException: TaskExecutor akka.tcp://flink@10.146
> .62.34:57289/user/rpc/taskmanager_0 has no more allocated slots for job
> d5152102d0ff19d8cbcff4490c431b7f.
>     at org.apache.flink.runtime.taskexecutor.TaskExecutor
> .closeJobManagerConnectionIfNoAllocatedResources(TaskExecutor.java:1948)
>     at org.apache.flink.runtime.taskexecutor.TaskExecutor
> .freeSlotInternal(TaskExecutor.java:1929)
>     at org.apache.flink.runtime.taskexecutor.TaskExecutor.timeoutSlot(
> TaskExecutor.java:1962)
>     at org.apache.flink.runtime.taskexecutor.TaskExecutor.access$3000(
> TaskExecutor.java:182)
>     at org.apache.flink.runtime.taskexecutor.TaskExecutor$SlotActionsImpl
> .lambda$timeoutSlot$1(TaskExecutor.java:2320)
>     at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRunAsync(
> AkkaRpcActor.java:440)
>     at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleRpcMessage(
> AkkaRpcActor.java:208)
>     at org.apache.flink.runtime.rpc.akka.AkkaRpcActor.handleMessage(
> AkkaRpcActor.java:158)
>     at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:26)
>     at akka.japi.pf.UnitCaseStatement.apply(CaseStatements.scala:21)
>     at scala.PartialFunction$class.applyOrElse(PartialFunction.scala:123)
>     at akka.japi.pf.UnitCaseStatement.applyOrElse(CaseStatements.scala:21)
>     at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:170)
>     at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171)
>     at scala.PartialFunction$OrElse.applyOrElse(PartialFunction.scala:171)
>     at akka.actor.Actor$class.aroundReceive(Actor.scala:517)
>     at akka.actor.AbstractActor.aroundReceive(AbstractActor.scala:225)
>     at akka.actor.ActorCell.receiveMessage(ActorCell.scala:592)
>     at akka.actor.ActorCell.invoke(ActorCell.scala:561)
>     at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:258)
>     at akka.dispatch.Mailbox.run(Mailbox.scala:225)
>     at akka.dispatch.Mailbox.exec(Mailbox.scala:235)
>     at akka.dispatch.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
>     at akka.dispatch.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool
> .java:1339)
>     at akka.dispatch.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:
> 1979)
>     at akka.dispatch.forkjoin.ForkJoinWorkerThread.run(
> ForkJoinWorkerThread.java:107)
>
>