You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Chesnay Schepler (Jira)" <ji...@apache.org> on 2022/06/14 12:02:00 UTC

[jira] [Created] (FLINK-28052) Remove AdaptiveSchedulerTest#RunFailedJobListener

Chesnay Schepler created FLINK-28052:
----------------------------------------

             Summary: Remove AdaptiveSchedulerTest#RunFailedJobListener
                 Key: FLINK-28052
                 URL: https://issues.apache.org/jira/browse/FLINK-28052
             Project: Flink
          Issue Type: Technical Debt
          Components: Runtime / Coordination, Tests
            Reporter: Chesnay Schepler
            Assignee: Chesnay Schepler
             Fix For: 1.16.0


The RunFailedJobListener has rather obscure semantics.
It considers a job to be terminal after it was restarted. This behavior is quite specific to a particular test case.
A cleaner approach is just to just cancel the job and wait for it to terminate.

Additionally it considered a job as running purely based on the job status, whereas, in particular when checkpointing is involved, waiting for the tasks to be submitted is a better measure.
In fact, testExceptionHistoryWithTaskFailureFromStopWithSavepoint is broken since a savepoint is never triggered, as not all tasks are running.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)