You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/06/14 12:04:00 UTC
[jira] [Updated] (FLINK-28052) Remove AdaptiveSchedulerTest#RunFailedJobListener
[ https://issues.apache.org/jira/browse/FLINK-28052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
ASF GitHub Bot updated FLINK-28052:
-----------------------------------
Labels: pull-request-available (was: )
> Remove AdaptiveSchedulerTest#RunFailedJobListener
> -------------------------------------------------
>
> Key: FLINK-28052
> URL: https://issues.apache.org/jira/browse/FLINK-28052
> Project: Flink
> Issue Type: Technical Debt
> Components: Runtime / Coordination, Tests
> Reporter: Chesnay Schepler
> Assignee: Chesnay Schepler
> Priority: Major
> Labels: pull-request-available
> Fix For: 1.16.0
>
>
> The RunFailedJobListener has rather obscure semantics.
> It considers a job to be terminal after it was restarted. This behavior is quite specific to a particular test case.
> A cleaner approach is just to just cancel the job and wait for it to terminate.
> Additionally it considered a job as running purely based on the job status, whereas, in particular when checkpointing is involved, waiting for the tasks to be submitted is a better measure.
> In fact, testExceptionHistoryWithTaskFailureFromStopWithSavepoint is broken since a savepoint is never triggered, as not all tasks are running.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)