You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ignite.apache.org by "Alexey Goncharuk (Jira)" <ji...@apache.org> on 2019/09/09 15:44:00 UTC

[jira] [Updated] (IGNITE-12128) Potentially pds corruption on a failed node during checkpoint

     [ https://issues.apache.org/jira/browse/IGNITE-12128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Alexey Goncharuk updated IGNITE-12128:
--------------------------------------
    Release Note: Fixed an issue causing potential PDS corruption when a node is killed during checkpoint mark phase

> Potentially pds corruption on a failed node during checkpoint
> -------------------------------------------------------------
>
>                 Key: IGNITE-12128
>                 URL: https://issues.apache.org/jira/browse/IGNITE-12128
>             Project: Ignite
>          Issue Type: Bug
>            Reporter: Dmitriy Govorukhin
>            Assignee: Anton Kalashnikov
>            Priority: Critical
>             Fix For: 2.7.6
>
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> There are the case when we start a checkpoint but not create CP file marker, but PageMemory may start to flush dirty pages from checkpoint pages to page store.  If node crashed at this moment, we can get inconsistency state, because we still not write checkpoint marker to disk but already write some pages for this checkpoint. If we try to recover from this state we cat get any sort of corruption problem. Recovery logic may not recognize that crash was during checkpoint because we did not write file marker when we start checkpoint but write some pages for this checkpoint.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)