You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Till Rohrmann (Jira)" <ji...@apache.org> on 2020/01/31 11:02:00 UTC
[jira] [Closed] (FLINK-8443) YARNSessionCapacitySchedulerITCase is
flakky
[ https://issues.apache.org/jira/browse/FLINK-8443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Till Rohrmann closed FLINK-8443.
--------------------------------
Resolution: Cannot Reproduce
The problem has not been reported for quite some time. Hence, closing this issue as cannot reproduce.
> YARNSessionCapacitySchedulerITCase is flakky
> --------------------------------------------
>
> Key: FLINK-8443
> URL: https://issues.apache.org/jira/browse/FLINK-8443
> Project: Flink
> Issue Type: Bug
> Components: Deployment / YARN
> Affects Versions: 1.5.0
> Reporter: Piotr Nowojski
> Priority: Major
> Attachments: 35.5.tar.gz
>
>
> Attached build logs from travis.
>
> Test(s) is failing with:
>
> {noformat}
> java.lang.AssertionError: Found a file /home/travis/build/dataArtisans/flink/flink-yarn-tests/target/flink-yarn-tests-capacityscheduler/flink-yarn-tests-capacityscheduler-logDir-nm-1_0/application_1516120275777_0003/container_1516120275
> 777_0003_01_000002/taskmanager.log with a prohibited string (one of [Exception, Started SelectChannelConnector@0.0.0.0:8081]). Excerpts{noformat}
> After downloading the yarn logs uploaded to transfer.sh there is a following failure:
>
> {code:java}
> 2018-01-16 16:32:10,553 INFO org.apache.flink.yarn.YarnTaskManager - Stopping TaskManager with final application status SUCCEEDED and diagnostics: Flink YARN Client requested shutdown
> 2018-01-16 16:32:10,577 INFO org.apache.flink.yarn.YarnTaskManager - Stopping TaskManager akka://flink/user/taskmanager#2122015748.
> 2018-01-16 16:32:10,578 INFO org.apache.flink.yarn.YarnTaskManager - Disassociating from JobManager
> 2018-01-16 16:32:10,588 INFO org.apache.flink.runtime.blob.PermanentBlobCache - Shutting down BLOB cache
> 2018-01-16 16:32:10,599 INFO org.apache.flink.runtime.blob.TransientBlobCache - Shutting down BLOB cache
> 2018-01-16 16:32:10,614 INFO org.apache.flink.runtime.io.disk.iomanager.IOManager - I/O manager removed spill file directory /home/travis/build/dataArtisans/flink/flink-yarn-tests/target/flink-yarn-tests-capacityscheduler/flink-yarn-tests-capacityscheduler-localDir-nm-1_0/usercache/travis/appcache/application_1516120275777_0003/flink-io-356a7c21-a3cd-43cb-926c-7690f861b66c
> 2018-01-16 16:32:10,615 INFO org.apache.flink.runtime.io.network.NetworkEnvironment - Shutting down the network environment and its components.
> 2018-01-16 16:32:10,619 INFO org.apache.flink.runtime.io.network.netty.NettyClient - Successful shutdown (took 4 ms).
> 2018-01-16 16:32:10,623 INFO org.apache.flink.runtime.io.network.netty.NettyServer - Successful shutdown (took 4 ms).
> 2018-01-16 16:32:10,641 INFO org.apache.flink.yarn.YarnTaskManager - Task manager akka://flink/user/taskmanager is completely shut down.
> 2018-01-16 16:32:10,649 INFO akka.remote.RemoteActorRefProvider$RemotingTerminator - Shutting down remote daemon.
> 2018-01-16 16:32:10,650 INFO akka.remote.RemoteActorRefProvider$RemotingTerminator - Remote daemon shut down; proceeding with flushing remote transports.
> 2018-01-16 16:32:10,717 WARN org.apache.flink.shaded.akka.org.jboss.netty.channel.DefaultChannelPipeline - An exception was thrown by an exception handler.
> java.util.concurrent.RejectedExecutionException: Worker has already been shutdown
> at org.apache.flink.shaded.akka.org.jboss.netty.channel.socket.nio.AbstractNioSelector.registerTask(AbstractNioSelector.java:120)
> at org.apache.flink.shaded.akka.org.jboss.netty.channel.socket.nio.AbstractNioWorker.executeInIoThread(AbstractNioWorker.java:72)
> at org.apache.flink.shaded.akka.org.jboss.netty.channel.socket.nio.NioWorker.executeInIoThread(NioWorker.java:36)
> at org.apache.flink.shaded.akka.org.jboss.netty.channel.socket.nio.AbstractNioWorker.executeInIoThread(AbstractNioWorker.java:56)
> at org.apache.flink.shaded.akka.org.jboss.netty.channel.socket.nio.NioWorker.executeInIoThread(NioWorker.java:36)
> at org.apache.flink.shaded.akka.org.jboss.netty.channel.socket.nio.AbstractNioChannelSink.execute(AbstractNioChannelSink.java:34)
> at org.apache.flink.shaded.akka.org.jboss.netty.channel.DefaultChannelPipeline.execute(DefaultChannelPipeline.java:636)
> at org.apache.flink.shaded.akka.org.jboss.netty.channel.Channels.fireExceptionCaughtLater(Channels.java:496)
> at org.apache.flink.shaded.akka.org.jboss.netty.channel.AbstractChannelSink.exceptionCaught(AbstractChannelSink.java:46)
> at org.apache.flink.shaded.akka.org.jboss.netty.channel.DefaultChannelPipeline.notifyHandlerException(DefaultChannelPipeline.java:658)
> at org.apache.flink.shaded.akka.org.jboss.netty.channel.DefaultChannelPipeline$DefaultChannelHandlerContext.sendDownstream(DefaultChannelPipeline.java:781)
> at org.apache.flink.shaded.akka.org.jboss.netty.handler.codec.oneone.OneToOneEncoder.handleDownstream(OneToOneEncoder.java:54)
> at org.apache.flink.shaded.akka.org.jboss.netty.channel.DefaultChannelPipeline.sendDownstream(DefaultChannelPipeline.java:591)
> at org.apache.flink.shaded.akka.org.jboss.netty.channel.DefaultChannelPipeline$DefaultChannelHandlerContext.sendDownstream(DefaultChannelPipeline.java:784)
> at org.apache.flink.shaded.akka.org.jboss.netty.channel.SimpleChannelHandler.disconnectRequested(SimpleChannelHandler.java:320)
> at org.apache.flink.shaded.akka.org.jboss.netty.channel.SimpleChannelHandler.handleDownstream(SimpleChannelHandler.java:274)
> at org.apache.flink.shaded.akka.org.jboss.netty.channel.DefaultChannelPipeline.sendDownstream(DefaultChannelPipeline.java:591)
> at org.apache.flink.shaded.akka.org.jboss.netty.channel.DefaultChannelPipeline.sendDownstream(DefaultChannelPipeline.java:582)
> at org.apache.flink.shaded.akka.org.jboss.netty.channel.Channels.disconnect(Channels.java:781)
> at org.apache.flink.shaded.akka.org.jboss.netty.channel.AbstractChannel.disconnect(AbstractChannel.java:219)
> at akka.remote.transport.netty.NettyTransport$$anonfun$gracefulClose$1.apply(NettyTransport.scala:241)
> at akka.remote.transport.netty.NettyTransport$$anonfun$gracefulClose$1.apply(NettyTransport.scala:240)
> at scala.util.Success.foreach(Try.scala:236)
> at scala.concurrent.Future$$anonfun$foreach$1.apply(Future.scala:206)
> at scala.concurrent.Future$$anonfun$foreach$1.apply(Future.scala:206)
> at scala.concurrent.impl.CallbackRunnable.run(Promise.scala:36)
> at akka.dispatch.BatchingExecutor$AbstractBatch.processBatch(BatchingExecutor.scala:55)
> at akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$1.apply$mcV$sp(BatchingExecutor.scala:91)
> at akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$1.apply(BatchingExecutor.scala:91)
> at akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$1.apply(BatchingExecutor.scala:91)
> at scala.concurrent.BlockContext$.withBlockContext(BlockContext.scala:72)
> at akka.dispatch.BatchingExecutor$BlockableBatch.run(BatchingExecutor.scala:90)
> at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:39)
> at akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:415)
> at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
> at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
> at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
> at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
> 2018-01-16 16:32:10,755 INFO org.apache.flink.yarn.YarnTaskManagerRunner - RECEIVED SIGNAL 15: SIGTERM. Shutting down as requested.}}
> 2018-01-16 16:32:10,762 INFO akka.remote.RemoteActorRefProvider$RemotingTerminator - Remoting shut down.
> 2018-01-16 16:32:10,794 INFO org.apache.flink.yarn.YarnTaskManager - Shutdown completed. Stopping JVM.
> {code}
>
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)