You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Matthias Pohl (Jira)" <ji...@apache.org> on 2024/01/24 13:00:00 UTC
[jira] [Comment Edited] (FLINK-34227) Github Actions: table stage timed out in AdaptiveScheduler profile
[ https://issues.apache.org/jira/browse/FLINK-34227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17810373#comment-17810373 ]
Matthias Pohl edited comment on FLINK-34227 at 1/24/24 12:59 PM:
-----------------------------------------------------------------
Ok, {{WindowDistinctAggregateITCase}} seems to throttle the overall stage execution with 21 succeeding but 35 failing:
{code}
➜ logs-test-nightly-adaptive-scheduler-64-table-test-1706074714 grep "Test test.*WindowDistinctAggregateITCase" mvn-2.log | grep -v "is running" | grep -c success
21
➜ logs-test-nightly-adaptive-scheduler-64-table-test-1706074714 grep "Test test.*WindowDistinctAggregateITCase" mvn-2.log | grep -v "is running" | grep -c fail
35
{code}
There are no other test failures in that stage:
{code}
➜ logs-test-nightly-adaptive-scheduler-64-table-test-1706074714 grep -c "failed with" mvn-2.log
35
{code}
All of them seem to have been caused by {{ResourceNotAvailableException}}s:
{code}
➜ logs-test-nightly-adaptive-scheduler-64-table-test-1706074714 grep -c "Failed to go from CreatingExecutionGraph to Executing because the ExecutionGraph creation failed." mvn-2.log
35
➜ logs-test-nightly-adaptive-scheduler-64-table-test-1706074714 grep -c "NoResourceAvailableException" mvn-2.log
140 # 140 / 35 = 4
{code}
was (Author: mapohl):
Ok, {{WindowDistinctAggregateITCase}} seems to throttle the overall stage execution with 21 succeeding but 35 failing:
{code}
➜ logs-test-nightly-adaptive-scheduler-64-table-test-1706074714 grep "Test test.*WindowDistinctAggregateITCase" mvn-2.log | grep -v "is running" | grep -c success
21
➜ logs-test-nightly-adaptive-scheduler-64-table-test-1706074714 grep "Test test.*WindowDistinctAggregateITCase" mvn-2.log | grep -v "is running" | grep -c fail
35
{code}
There are no other test failures in that stage:
{code}
➜ logs-test-nightly-adaptive-scheduler-64-table-test-1706074714 grep -c "failed with" mvn-2.log
35
{code}
> Github Actions: table stage timed out in AdaptiveScheduler profile
> ------------------------------------------------------------------
>
> Key: FLINK-34227
> URL: https://issues.apache.org/jira/browse/FLINK-34227
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Coordination
> Affects Versions: 1.18.1
> Reporter: Matthias Pohl
> Priority: Major
> Labels: github-actions, test-stability
>
> https://github.com/XComp/flink/actions/runs/7634987973/job/20800205972#step:10:14557
> {code}
> [...]
> "main" #1 prio=5 os_prio=0 tid=0x00007fcccc4b7000 nid=0x24ec0 waiting on condition [0x00007fccce1eb000]
> java.lang.Thread.State: WAITING (parking)
> at sun.misc.Unsafe.park(Native Method)
> - parking to wait for <0x00000000bdd52618> (a java.util.concurrent.CompletableFuture$Signaller)
> at java.util.concurrent.locks.LockSupport.park(LockSupport.java:175)
> at java.util.concurrent.CompletableFuture$Signaller.block(CompletableFuture.java:1707)
> at java.util.concurrent.ForkJoinPool.managedBlock(ForkJoinPool.java:3323)
> at java.util.concurrent.CompletableFuture.waitingGet(CompletableFuture.java:1742)
> at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1908)
> at org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.execute(StreamExecutionEnvironment.java:2131)
> at org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.execute(StreamExecutionEnvironment.java:2099)
> at org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.execute(StreamExecutionEnvironment.java:2077)
> at org.apache.flink.streaming.api.scala.StreamExecutionEnvironment.execute(StreamExecutionEnvironment.scala:876)
> at org.apache.flink.table.planner.runtime.stream.sql.WindowDistinctAggregateITCase.testHopWindow_Cube(WindowDistinctAggregateITCase.scala:550)
> [...]
> {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)