You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Robert Metzger (Jira)" <ji...@apache.org> on 2022/07/12 07:22:00 UTC
[jira] [Comment Edited] (FLINK-28423) ExecutionVertexCancelTest crashed with exit code 239
[ https://issues.apache.org/jira/browse/FLINK-28423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17565324#comment-17565324 ]
Robert Metzger edited comment on FLINK-28423 at 7/12/22 7:21 AM:
-----------------------------------------------------------------
This error usually indicates that the FatalExecptionHandler has been called. But I can't see any evidence from the logs. It seems to be caused by the {{ExecutionVertexCancelTest}}, here are the last lines:
{code}
================================================================================
Test testCancelFromScheduled(org.apache.flink.runtime.executiongraph.ExecutionVertexCancelTest) is running.
--------------------------------------------------------------------------------
03:36:05,646 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - Created execution graph 2d39dc1365af352d59f55034de0a5618 for job f81f60a2092cbe3286458b04211040ea.
03:36:05,646 [ main] INFO org.apache.flink.runtime.scheduler.SchedulerTestingUtils [] - Running initialization on master for job Unnamed job (f81f60a2092cbe3286458b04211040ea).
03:36:05,646 [ main] INFO org.apache.flink.runtime.scheduler.SchedulerTestingUtils [] - Successfully ran initialization on master in 0 ms.
03:36:05,647 [ main] INFO org.apache.flink.runtime.scheduler.adapter.DefaultExecutionTopology [] - Built 1 new pipelined regions in 0 ms, total 1 pipelined regions currently.
03:36:05,647 [ main] INFO org.apache.flink.runtime.scheduler.SchedulerTestingUtils [] - Using failover strategy org.apache.flink.runtime.executiongraph.failover.flip1.RestartPipelinedRegionFailoverStrategy@7a977d23 for Unnamed job (f81f60a2092cbe3286458b04211040ea).
03:36:05,647 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - TestVertex (1/1) (2d39dc1365af352d59f55034de0a5618_403847e7f01320ff1624af5085616f1f_0_0) switched from SCHEDULED to CANCELING.
03:36:05,647 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - TestVertex (1/1) (2d39dc1365af352d59f55034de0a5618_403847e7f01320ff1624af5085616f1f_0_0) switched from CANCELING to CANCELED.
03:36:05,647 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - Discarding the results produced by task execution 2d39dc1365af352d59f55034de0a5618_403847e7f01320ff1624af5085616f1f_0_0.
03:36:05,647 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionVertexCancelTest [] -
--------------------------------------------------------------------------------
Test testCancelFromScheduled(org.apache.flink.runtime.executiongraph.ExecutionVertexCancelTest) successfully run.
================================================================================
03:36:05,648 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionVertexCancelTest [] -
================================================================================
Test testRepeatedCancelFromRunning(org.apache.flink.runtime.executiongraph.ExecutionVertexCancelTest) is running.
--------------------------------------------------------------------------------
03:36:05,648 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - Created execution graph e43f14294e9e15f2495185182b3cc6e1 for job 7dee3fa6bd5706737478f5c7257fd2fb.
03:36:05,648 [ main] INFO org.apache.flink.runtime.scheduler.SchedulerTestingUtils [] - Running initialization on master for job Unnamed job (7dee3fa6bd5706737478f5c7257fd2fb).
03:36:05,648 [ main] INFO org.apache.flink.runtime.scheduler.SchedulerTestingUtils [] - Successfully ran initialization on master in 0 ms.
03:36:05,649 [ main] INFO org.apache.flink.runtime.scheduler.adapter.DefaultExecutionTopology [] - Built 1 new pipelined regions in 0 ms, total 1 pipelined regions currently.
03:36:05,653 [ main] INFO org.apache.flink.runtime.scheduler.SchedulerTestingUtils [] - Using failover strategy org.apache.flink.runtime.executiongraph.failover.flip1.RestartPipelinedRegionFailoverStrategy@267e36f5 for Unnamed job (7dee3fa6bd5706737478f5c7257fd2fb).
03:36:05,653 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - TestVertex (1/1) (e43f14294e9e15f2495185182b3cc6e1_22419611cc620103f97874a5f4f35f01_0_0) switched from RUNNING to CANCELING.
03:36:05,654 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - TestVertex (1/1) (e43f14294e9e15f2495185182b3cc6e1_22419611cc620103f97874a5f4f35f01_0_0) switched from CANCELING to CANCELED.
03:36:05,654 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - Discarding the results produced by task execution e43f14294e9e15f2495185182b3cc6e1_22419611cc620103f97874a5f4f35f01_0_0.
03:36:05,654 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionVertexCancelTest [] -
--------------------------------------------------------------------------------
Test testRepeatedCancelFromRunning(org.apache.flink.runtime.executiongraph.ExecutionVertexCancelTest) successfully run.
================================================================================
{code}
was (Author: rmetzger):
This error usually indicates that the FatalExecptionHandler has been called. But I can't see any evidence from the logs. It seems to be caused by the {{ExecutionVertexCancelTest}}, here are the last lines:
{java}
================================================================================
Test testCancelFromScheduled(org.apache.flink.runtime.executiongraph.ExecutionVertexCancelTest) is running.
--------------------------------------------------------------------------------
03:36:05,646 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - Created execution graph 2d39dc1365af352d59f55034de0a5618 for job f81f60a2092cbe3286458b04211040ea.
03:36:05,646 [ main] INFO org.apache.flink.runtime.scheduler.SchedulerTestingUtils [] - Running initialization on master for job Unnamed job (f81f60a2092cbe3286458b04211040ea).
03:36:05,646 [ main] INFO org.apache.flink.runtime.scheduler.SchedulerTestingUtils [] - Successfully ran initialization on master in 0 ms.
03:36:05,647 [ main] INFO org.apache.flink.runtime.scheduler.adapter.DefaultExecutionTopology [] - Built 1 new pipelined regions in 0 ms, total 1 pipelined regions currently.
03:36:05,647 [ main] INFO org.apache.flink.runtime.scheduler.SchedulerTestingUtils [] - Using failover strategy org.apache.flink.runtime.executiongraph.failover.flip1.RestartPipelinedRegionFailoverStrategy@7a977d23 for Unnamed job (f81f60a2092cbe3286458b04211040ea).
03:36:05,647 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - TestVertex (1/1) (2d39dc1365af352d59f55034de0a5618_403847e7f01320ff1624af5085616f1f_0_0) switched from SCHEDULED to CANCELING.
03:36:05,647 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - TestVertex (1/1) (2d39dc1365af352d59f55034de0a5618_403847e7f01320ff1624af5085616f1f_0_0) switched from CANCELING to CANCELED.
03:36:05,647 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - Discarding the results produced by task execution 2d39dc1365af352d59f55034de0a5618_403847e7f01320ff1624af5085616f1f_0_0.
03:36:05,647 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionVertexCancelTest [] -
--------------------------------------------------------------------------------
Test testCancelFromScheduled(org.apache.flink.runtime.executiongraph.ExecutionVertexCancelTest) successfully run.
================================================================================
03:36:05,648 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionVertexCancelTest [] -
================================================================================
Test testRepeatedCancelFromRunning(org.apache.flink.runtime.executiongraph.ExecutionVertexCancelTest) is running.
--------------------------------------------------------------------------------
03:36:05,648 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - Created execution graph e43f14294e9e15f2495185182b3cc6e1 for job 7dee3fa6bd5706737478f5c7257fd2fb.
03:36:05,648 [ main] INFO org.apache.flink.runtime.scheduler.SchedulerTestingUtils [] - Running initialization on master for job Unnamed job (7dee3fa6bd5706737478f5c7257fd2fb).
03:36:05,648 [ main] INFO org.apache.flink.runtime.scheduler.SchedulerTestingUtils [] - Successfully ran initialization on master in 0 ms.
03:36:05,649 [ main] INFO org.apache.flink.runtime.scheduler.adapter.DefaultExecutionTopology [] - Built 1 new pipelined regions in 0 ms, total 1 pipelined regions currently.
03:36:05,653 [ main] INFO org.apache.flink.runtime.scheduler.SchedulerTestingUtils [] - Using failover strategy org.apache.flink.runtime.executiongraph.failover.flip1.RestartPipelinedRegionFailoverStrategy@267e36f5 for Unnamed job (7dee3fa6bd5706737478f5c7257fd2fb).
03:36:05,653 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - TestVertex (1/1) (e43f14294e9e15f2495185182b3cc6e1_22419611cc620103f97874a5f4f35f01_0_0) switched from RUNNING to CANCELING.
03:36:05,654 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - TestVertex (1/1) (e43f14294e9e15f2495185182b3cc6e1_22419611cc620103f97874a5f4f35f01_0_0) switched from CANCELING to CANCELED.
03:36:05,654 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionGraph [] - Discarding the results produced by task execution e43f14294e9e15f2495185182b3cc6e1_22419611cc620103f97874a5f4f35f01_0_0.
03:36:05,654 [ main] INFO org.apache.flink.runtime.executiongraph.ExecutionVertexCancelTest [] -
--------------------------------------------------------------------------------
Test testRepeatedCancelFromRunning(org.apache.flink.runtime.executiongraph.ExecutionVertexCancelTest) successfully run.
================================================================================
{java}
> ExecutionVertexCancelTest crashed with exit code 239
> ----------------------------------------------------
>
> Key: FLINK-28423
> URL: https://issues.apache.org/jira/browse/FLINK-28423
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Coordination
> Affects Versions: 1.16.0
> Reporter: Martijn Visser
> Priority: Critical
> Labels: test-stability
>
> {code:java}
> Jul 06 03:40:19 [ERROR] Command was /bin/sh -c cd /__w/1/s/flink-runtime && /usr/lib/jvm/java-8-openjdk-amd64/jre/bin/java -XX:+UseG1GC -Xms256m -Xmx768m -jar /__w/1/s/flink-runtime/target/surefire/surefirebooter4834330438302305324.jar /__w/1/s/flink-runtime/target/surefire 2022-07-06T03-33-45_475-jvmRun2 surefire1590202131814132315tmp surefire_28620311342464803109tmp
> Jul 06 03:40:19 [ERROR] Error occurred in starting fork, check output in log
> Jul 06 03:40:19 [ERROR] Process Exit Code: 239
> Jul 06 03:40:19 [ERROR] Crashed tests:
> Jul 06 03:40:19 [ERROR] org.apache.flink.runtime.executiongraph.ExecutionVertexCancelTest
> Jul 06 03:40:19 [ERROR] org.apache.maven.surefire.booter.SurefireBooterForkException: ExecutionException The forked VM terminated without properly saying goodbye. VM crash or System.exit called?
> Jul 06 03:40:19 [ERROR] Command was /bin/sh -c cd /__w/1/s/flink-runtime && /usr/lib/jvm/java-8-openjdk-amd64/jre/bin/java -XX:+UseG1GC -Xms256m -Xmx768m -jar /__w/1/s/flink-runtime/target/surefire/surefirebooter4834330438302305324.jar /__w/1/s/flink-runtime/target/surefire 2022-07-06T03-33-45_475-jvmRun2 surefire1590202131814132315tmp surefire_28620311342464803109tmp
> Jul 06 03:40:19 [ERROR] Error occurred in starting fork, check output in log
> Jul 06 03:40:19 [ERROR] Process Exit Code: 239
> Jul 06 03:40:19 [ERROR] Crashed tests:
> Jul 06 03:40:19 [ERROR] org.apache.flink.runtime.executiongraph.ExecutionVertexCancelTest
> Jul 06 03:40:19 [ERROR] at org.apache.maven.plugin.surefire.booterclient.ForkStarter.awaitResultsDone(ForkStarter.java:532)
> Jul 06 03:40:19 [ERROR] at org.apache.maven.plugin.surefire.booterclient.ForkStarter.runSuitesForkOnceMultiple(ForkStarter.java:405)
> Jul 06 03:40:19 [ERROR] at org.apache.maven.plugin.surefire.booterclient.ForkStarter.run(ForkStarter.java:321)
> Jul 06 03:40:19 [ERROR] at org.apache.maven.plugin.surefire.booterclient.ForkStarter.run(ForkStarter.java:266)
> Jul 06 03:40:19 [ERROR] at org.apache.maven.plugin.surefire.AbstractSurefireMojo.executeProvider(AbstractSurefireMojo.java:1314)
> Jul 06 03:40:19 [ERROR] at org.apache.maven.plugin.surefire.AbstractSurefireMojo.executeAfterPreconditionsChecked(AbstractSurefireMojo.java:1159)
> Jul 06 03:40:19 [ERROR] at org.apache.maven.plugin.surefire.AbstractSurefireMojo.execute(AbstractSurefireMojo.java:932)
> Jul 06 03:40:19 [ERROR] at org.apache.maven.plugin.DefaultBuildPluginManager.executeMojo(DefaultBuildPluginManager.java:132)
> Jul 06 03:40:19 [ERROR] at org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:208)
> Jul 06 03:40:19 [ERROR] at org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:153)
> Jul 06 03:40:19 [ERROR] at org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:145)
> Jul 06 03:40:19 [ERROR] at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject(LifecycleModuleBuilder.java:116)
> Jul 06 03:40:19 [ERROR] at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject(LifecycleModuleBuilder.java:80)
> Jul 06 03:40:19 [ERROR] at org.apache.maven.lifecycle.internal.builder.singlethreaded.SingleThreadedBuilder.build(SingleThreadedBuilder.java:51)
> {code}
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=37685&view=logs&j=4d4a0d10-fca2-5507-8eed-c07f0bdf4887&t=7b25afdf-cc6c-566f-5459-359dc2585798&l=8128
--
This message was sent by Atlassian Jira
(v8.20.10#820010)