You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Matthias Pohl (Jira)" <ji...@apache.org> on 2024/01/24 13:00:00 UTC

[jira] [Comment Edited] (FLINK-34227) Github Actions: table stage timed out in AdaptiveScheduler profile

    [ https://issues.apache.org/jira/browse/FLINK-34227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17810373#comment-17810373 ] 

Matthias Pohl edited comment on FLINK-34227 at 1/24/24 12:59 PM:
-----------------------------------------------------------------

Ok, {{WindowDistinctAggregateITCase}} seems to throttle the overall stage execution with 21 succeeding but 35 failing:
{code}
➜  logs-test-nightly-adaptive-scheduler-64-table-test-1706074714 grep "Test test.*WindowDistinctAggregateITCase" mvn-2.log | grep -v "is running" | grep -c success
21
➜  logs-test-nightly-adaptive-scheduler-64-table-test-1706074714 grep "Test test.*WindowDistinctAggregateITCase" mvn-2.log | grep -v "is running" | grep -c fail   
35
{code}
There are no other test failures in that stage:
{code}
➜  logs-test-nightly-adaptive-scheduler-64-table-test-1706074714 grep -c "failed with" mvn-2.log
35
{code}
All of them seem to have been caused by {{ResourceNotAvailableException}}s:
{code}
➜  logs-test-nightly-adaptive-scheduler-64-table-test-1706074714 grep -c "Failed to go from CreatingExecutionGraph to Executing because the ExecutionGraph creation failed." mvn-2.log 
35
➜  logs-test-nightly-adaptive-scheduler-64-table-test-1706074714 grep -c "NoResourceAvailableException" mvn-2.log
140 # 140 / 35 = 4
{code}


was (Author: mapohl):
Ok, {{WindowDistinctAggregateITCase}} seems to throttle the overall stage execution with 21 succeeding but 35 failing:
{code}
➜  logs-test-nightly-adaptive-scheduler-64-table-test-1706074714 grep "Test test.*WindowDistinctAggregateITCase" mvn-2.log | grep -v "is running" | grep -c success
21
➜  logs-test-nightly-adaptive-scheduler-64-table-test-1706074714 grep "Test test.*WindowDistinctAggregateITCase" mvn-2.log | grep -v "is running" | grep -c fail   
35
{code}
There are no other test failures in that stage:
{code}
➜  logs-test-nightly-adaptive-scheduler-64-table-test-1706074714 grep -c "failed with" mvn-2.log
35
{code}

> Github Actions: table stage timed out in AdaptiveScheduler profile
> ------------------------------------------------------------------
>
>                 Key: FLINK-34227
>                 URL: https://issues.apache.org/jira/browse/FLINK-34227
>             Project: Flink
>          Issue Type: Bug
>          Components: Runtime / Coordination
>    Affects Versions: 1.18.1
>            Reporter: Matthias Pohl
>            Priority: Major
>              Labels: github-actions, test-stability
>
> https://github.com/XComp/flink/actions/runs/7634987973/job/20800205972#step:10:14557
> {code}
> [...]
> "main" #1 prio=5 os_prio=0 tid=0x00007fcccc4b7000 nid=0x24ec0 waiting on condition [0x00007fccce1eb000]
>    java.lang.Thread.State: WAITING (parking)
> 	at sun.misc.Unsafe.park(Native Method)
> 	- parking to wait for  <0x00000000bdd52618> (a java.util.concurrent.CompletableFuture$Signaller)
> 	at java.util.concurrent.locks.LockSupport.park(LockSupport.java:175)
> 	at java.util.concurrent.CompletableFuture$Signaller.block(CompletableFuture.java:1707)
> 	at java.util.concurrent.ForkJoinPool.managedBlock(ForkJoinPool.java:3323)
> 	at java.util.concurrent.CompletableFuture.waitingGet(CompletableFuture.java:1742)
> 	at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1908)
> 	at org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.execute(StreamExecutionEnvironment.java:2131)
> 	at org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.execute(StreamExecutionEnvironment.java:2099)
> 	at org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.execute(StreamExecutionEnvironment.java:2077)
> 	at org.apache.flink.streaming.api.scala.StreamExecutionEnvironment.execute(StreamExecutionEnvironment.scala:876)
> 	at org.apache.flink.table.planner.runtime.stream.sql.WindowDistinctAggregateITCase.testHopWindow_Cube(WindowDistinctAggregateITCase.scala:550)
> [...]
> {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)