You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/06/14 12:04:00 UTC

[jira] [Updated] (FLINK-28052) Remove AdaptiveSchedulerTest#RunFailedJobListener

     [ https://issues.apache.org/jira/browse/FLINK-28052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ASF GitHub Bot updated FLINK-28052:
-----------------------------------
    Labels: pull-request-available  (was: )

> Remove AdaptiveSchedulerTest#RunFailedJobListener
> -------------------------------------------------
>
>                 Key: FLINK-28052
>                 URL: https://issues.apache.org/jira/browse/FLINK-28052
>             Project: Flink
>          Issue Type: Technical Debt
>          Components: Runtime / Coordination, Tests
>            Reporter: Chesnay Schepler
>            Assignee: Chesnay Schepler
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 1.16.0
>
>
> The RunFailedJobListener has rather obscure semantics.
> It considers a job to be terminal after it was restarted. This behavior is quite specific to a particular test case.
> A cleaner approach is just to just cancel the job and wait for it to terminate.
> Additionally it considered a job as running purely based on the job status, whereas, in particular when checkpointing is involved, waiting for the tasks to be submitted is a better measure.
> In fact, testExceptionHistoryWithTaskFailureFromStopWithSavepoint is broken since a savepoint is never triggered, as not all tasks are running.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)