You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Chesnay Schepler (JIRA)" <ji...@apache.org> on 2019/07/24 11:09:00 UTC

[jira] [Closed] (FLINK-13169) IT test for fine-grained recovery (task executor failures)

     [ https://issues.apache.org/jira/browse/FLINK-13169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chesnay Schepler closed FLINK-13169.
------------------------------------
    Resolution: Fixed

master: 6902915509323e28611af050f59911964daa3d8f

1.9: ada7613e7e1ecdce2c925ff5051ec7047d2d8e5d

> IT test for fine-grained recovery (task executor failures)
> ----------------------------------------------------------
>
>                 Key: FLINK-13169
>                 URL: https://issues.apache.org/jira/browse/FLINK-13169
>             Project: Flink
>          Issue Type: Sub-task
>          Components: Runtime / Coordination
>            Reporter: Andrey Zagrebin
>            Assignee: Andrey Zagrebin
>            Priority: Critical
>              Labels: pull-request-available
>             Fix For: 1.9.0
>
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> The BatchFineGrainedRecoveryITCase can be extended with an additional test failure strategy which abruptly shuts down the task executor. This leads to the loss of all previously completed and the in-progress mapper result partitions. The fail-over strategy should restart the current in-progress mapper which will get the PartitionNotFoundException because the previous result becomes unavailable and the previous mapper has to be restarted as well. The same should happen subsequently with all previous mappers. When the source is recomputed, all mappers has to be restarted again to recalculate their lost results.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)