You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@ignite.apache.org by "Andrey Aleksandrov (JIRA)" <ji...@apache.org> on 2018/10/23 11:29:00 UTC

[jira] [Created] (IGNITE-9975) Possible data loss during starting of the nodes with empty pds

Andrey Aleksandrov created IGNITE-9975:
------------------------------------------

             Summary: Possible data loss during starting of the nodes with empty pds
                 Key: IGNITE-9975
                 URL: https://issues.apache.org/jira/browse/IGNITE-9975
             Project: Ignite
          Issue Type: Bug
          Components: persistence
    Affects Versions: 2.4
            Reporter: Andrey Aleksandrov
            Assignee: Alexey Stelmak
             Fix For: 2.7
         Attachments: reproducer.java

Case: 

1)Start 3 data nodes and activate the cluster with cache with 1 backup and PartitionLossPolicy.READ_ONLY_SAFE.
2)Start client and add the data to your cache. Stop the client
3)Stop DN2 and clear it pds and val
4)Start DN2. Rebalance will start.
5)During rebalance stop DN3.
6)Start DN3.

At this moment some partitions from DN2 marked as LOST and cache size will be less than expected.

7) Run resetLostPartitions(caches).

Now all partitions on DN2 marked as OWNING but cache size is still less than expected.

Workaround:

after step 6 do:

7)force rebalance using deactivate/activate methods.
8)wait for completion of rebalance

Now cache size is expected but some partitions from DN2 marked as LOST

9)Run resetLostPartitions(caches).

Now cache size is OK and all partitions from DN2 marked as OWNING.

However, looks like without force rebalance we have data loss here.




 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)