You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ignite.apache.org by "Ivan Rakov (JIRA)" <ji...@apache.org> on 2019/02/20 11:41:00 UTC

[jira] [Updated] (IGNITE-11367) Fix several issues in PageMemoryTracker

     [ https://issues.apache.org/jira/browse/IGNITE-11367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ivan Rakov updated IGNITE-11367:
--------------------------------
    Description: 
I've discovered some issues in PageMemoryTracker while debugging IGNITE-10873:
1) Mock page memory doesn't implement PageMemoryImpl#pageBuffer. As a result, some delta records (which applies changes via buffer) can't be applied. Example:
{code:java}
		Caused by: java.lang.NullPointerException
			at org.apache.ignite.internal.processors.cache.persistence.tree.io.TrackingPageIO.getLastSnapshotTag0(TrackingPageIO.java:235)
			at org.apache.ignite.internal.processors.cache.persistence.tree.io.TrackingPageIO.getLastSnapshotTag(TrackingPageIO.java:227)
			at org.apache.ignite.internal.processors.cache.persistence.tree.io.TrackingPageIO.validateSnapshotTag(TrackingPageIO.java:135)
			at org.apache.ignite.internal.processors.cache.persistence.tree.io.TrackingPageIO.markChanged(TrackingPageIO.java:93)
			at org.apache.ignite.internal.pagemem.wal.record.delta.TrackingPageDeltaRecord.applyDelta(TrackingPageDeltaRecord.java:75)
			at org.apache.ignite.internal.processors.cache.persistence.wal.memtracker.PageMemoryTracker.applyWalRecord(PageMemoryTracker.java:447)
			at org.apache.ignite.internal.processors.cache.persistence.wal.memtracker.PageMemoryTracker.access$000(PageMemoryTracker.java:81)
			at org.apache.ignite.internal.processors.cache.persistence.wal.memtracker.PageMemoryTracker$1.log(PageMemoryTracker.java:159)
			at org.gridgain.grid.internal.processors.cache.database.snapshot.GridCacheSnapshotManager.onChangeTrackerPage(GridCacheSnapshotManager.java:2801)
			at org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager$5.applyx(GridCacheDatabaseSharedManager.java:1084)
			at org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager$5.applyx(GridCacheDatabaseSharedManager.java:1077)
			at org.apache.ignite.internal.util.lang.GridInClosure3X.apply(GridInClosure3X.java:34)
			at org.apache.ignite.internal.processors.cache.persistence.pagemem.PageMemoryImpl.writeUnlockPage(PageMemoryImpl.java:1572)
			at org.apache.ignite.internal.processors.cache.persistence.pagemem.PageMemoryImpl.writeUnlock(PageMemoryImpl.java:495)
			at org.apache.ignite.internal.processors.cache.persistence.pagemem.PageMemoryImpl.writeUnlock(PageMemoryImpl.java:487)
			at org.apache.ignite.internal.processors.cache.persistence.tree.util.PageHandler.writeUnlock(PageHandler.java:394)
			at org.apache.ignite.internal.processors.cache.persistence.tree.util.PageHandler.writePage(PageHandler.java:369)
			at org.apache.ignite.internal.processors.cache.persistence.DataStructure.write(DataStructure.java:285)
			at org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree.access$11500(BPlusTree.java:92)
			at org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree$Put.tryReplace(BPlusTree.java:3638)
			at org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree.putDown(BPlusTree.java:2565)
			at org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree.doPut(BPlusTree.java:2293)
			... 33 more
{code}
2) During binary recovery phase, page memory is changed by applying delta records and page snapshots (GridCacheDatabaseSharedManager#applyPageDelta, #applyPageSnapshot). Such changes are not replicated by logging delta records to WAL (we don't log physical records on binary recovery - we just apply already logged ones). This leads to false positive broken consistency reports. To prevent this, we should apply changes to both regular page memory and mock page memory in PageMemoryTracker.
3) PagesList.java:918:
{code:java}
                        // Here we should never write full page, because it is known to be new.
                        if (needWalDeltaRecord(nextId, nextPage, FALSE))
                            wal.log(new PagesListInitNewPageRecord(
                                grpId,
                                nextId,
                                io.getType(),
                                io.getVersion(),
                                nextId,
                                prevId,
                                0L
                            ));
{code}
Sometimes we may log InitNewPageRecord without logging page snapshot. In case page was recycled before, its content in unused page part may trigger false positive reports. We may fix this by filling page content with zeros during PagesList#setupNextPage call.

Modified test from attached patch should pass if all mentioned issues will be fixed. Please ensure that both Mockito and ignite-indexing are in classpath when you run the test.

  was:
I've discovered some issues in PageMemoryTracker while debugging IGNITE-10873:
1) Mock page memory doesn't implement PageMemoryImpl#pageBuffer. As a result, some delta records (which applies changes via buffer) can't be applied. Example:
{code:java}
		Caused by: java.lang.NullPointerException
			at org.apache.ignite.internal.processors.cache.persistence.tree.io.TrackingPageIO.getLastSnapshotTag0(TrackingPageIO.java:235)
			at org.apache.ignite.internal.processors.cache.persistence.tree.io.TrackingPageIO.getLastSnapshotTag(TrackingPageIO.java:227)
			at org.apache.ignite.internal.processors.cache.persistence.tree.io.TrackingPageIO.validateSnapshotTag(TrackingPageIO.java:135)
			at org.apache.ignite.internal.processors.cache.persistence.tree.io.TrackingPageIO.markChanged(TrackingPageIO.java:93)
			at org.apache.ignite.internal.pagemem.wal.record.delta.TrackingPageDeltaRecord.applyDelta(TrackingPageDeltaRecord.java:75)
			at org.apache.ignite.internal.processors.cache.persistence.wal.memtracker.PageMemoryTracker.applyWalRecord(PageMemoryTracker.java:447)
			at org.apache.ignite.internal.processors.cache.persistence.wal.memtracker.PageMemoryTracker.access$000(PageMemoryTracker.java:81)
			at org.apache.ignite.internal.processors.cache.persistence.wal.memtracker.PageMemoryTracker$1.log(PageMemoryTracker.java:159)
			at org.gridgain.grid.internal.processors.cache.database.snapshot.GridCacheSnapshotManager.onChangeTrackerPage(GridCacheSnapshotManager.java:2801)
			at org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager$5.applyx(GridCacheDatabaseSharedManager.java:1084)
			at org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager$5.applyx(GridCacheDatabaseSharedManager.java:1077)
			at org.apache.ignite.internal.util.lang.GridInClosure3X.apply(GridInClosure3X.java:34)
			at org.apache.ignite.internal.processors.cache.persistence.pagemem.PageMemoryImpl.writeUnlockPage(PageMemoryImpl.java:1572)
			at org.apache.ignite.internal.processors.cache.persistence.pagemem.PageMemoryImpl.writeUnlock(PageMemoryImpl.java:495)
			at org.apache.ignite.internal.processors.cache.persistence.pagemem.PageMemoryImpl.writeUnlock(PageMemoryImpl.java:487)
			at org.apache.ignite.internal.processors.cache.persistence.tree.util.PageHandler.writeUnlock(PageHandler.java:394)
			at org.apache.ignite.internal.processors.cache.persistence.tree.util.PageHandler.writePage(PageHandler.java:369)
			at org.apache.ignite.internal.processors.cache.persistence.DataStructure.write(DataStructure.java:285)
			at org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree.access$11500(BPlusTree.java:92)
			at org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree$Put.tryReplace(BPlusTree.java:3638)
			at org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree.putDown(BPlusTree.java:2565)
			at org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree.doPut(BPlusTree.java:2293)
			... 33 more
{code}
2) During binary recovery phase, page memory is changed with applying delta records and page snapshots (GridCacheDatabaseSharedManager#applyPageDelta, #applyPageSnapshot). Such changes are not replicated by logging delta records to WAL (we don't log physical records on binary recovery - we just apply already logged ones). This leads to false positive broken consistency reports. To prevent this, we should apply changes to both regular page memory and mock page memory in PageMemoryTracker.
3) PagesList.java:918:
{code:java}
                        // Here we should never write full page, because it is known to be new.
                        if (needWalDeltaRecord(nextId, nextPage, FALSE))
                            wal.log(new PagesListInitNewPageRecord(
                                grpId,
                                nextId,
                                io.getType(),
                                io.getVersion(),
                                nextId,
                                prevId,
                                0L
                            ));
{code}
Sometimes we may log InitNewPageRecord without logging page snapshot. In case page was recycled before, its content in unused page part may trigger false positive reports. We may fix this by filling page content with zeros during PagesList#setupNextPage call.

Modified test from attached patch should pass if all mentioned issues will be fixed. Please ensure that both Mockito and ignite-indexing are in classpath when you run the test.


> Fix several issues in PageMemoryTracker
> ---------------------------------------
>
>                 Key: IGNITE-11367
>                 URL: https://issues.apache.org/jira/browse/IGNITE-11367
>             Project: Ignite
>          Issue Type: Bug
>            Reporter: Ivan Rakov
>            Priority: Major
>             Fix For: 2.8
>
>
> I've discovered some issues in PageMemoryTracker while debugging IGNITE-10873:
> 1) Mock page memory doesn't implement PageMemoryImpl#pageBuffer. As a result, some delta records (which applies changes via buffer) can't be applied. Example:
> {code:java}
> 		Caused by: java.lang.NullPointerException
> 			at org.apache.ignite.internal.processors.cache.persistence.tree.io.TrackingPageIO.getLastSnapshotTag0(TrackingPageIO.java:235)
> 			at org.apache.ignite.internal.processors.cache.persistence.tree.io.TrackingPageIO.getLastSnapshotTag(TrackingPageIO.java:227)
> 			at org.apache.ignite.internal.processors.cache.persistence.tree.io.TrackingPageIO.validateSnapshotTag(TrackingPageIO.java:135)
> 			at org.apache.ignite.internal.processors.cache.persistence.tree.io.TrackingPageIO.markChanged(TrackingPageIO.java:93)
> 			at org.apache.ignite.internal.pagemem.wal.record.delta.TrackingPageDeltaRecord.applyDelta(TrackingPageDeltaRecord.java:75)
> 			at org.apache.ignite.internal.processors.cache.persistence.wal.memtracker.PageMemoryTracker.applyWalRecord(PageMemoryTracker.java:447)
> 			at org.apache.ignite.internal.processors.cache.persistence.wal.memtracker.PageMemoryTracker.access$000(PageMemoryTracker.java:81)
> 			at org.apache.ignite.internal.processors.cache.persistence.wal.memtracker.PageMemoryTracker$1.log(PageMemoryTracker.java:159)
> 			at org.gridgain.grid.internal.processors.cache.database.snapshot.GridCacheSnapshotManager.onChangeTrackerPage(GridCacheSnapshotManager.java:2801)
> 			at org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager$5.applyx(GridCacheDatabaseSharedManager.java:1084)
> 			at org.apache.ignite.internal.processors.cache.persistence.GridCacheDatabaseSharedManager$5.applyx(GridCacheDatabaseSharedManager.java:1077)
> 			at org.apache.ignite.internal.util.lang.GridInClosure3X.apply(GridInClosure3X.java:34)
> 			at org.apache.ignite.internal.processors.cache.persistence.pagemem.PageMemoryImpl.writeUnlockPage(PageMemoryImpl.java:1572)
> 			at org.apache.ignite.internal.processors.cache.persistence.pagemem.PageMemoryImpl.writeUnlock(PageMemoryImpl.java:495)
> 			at org.apache.ignite.internal.processors.cache.persistence.pagemem.PageMemoryImpl.writeUnlock(PageMemoryImpl.java:487)
> 			at org.apache.ignite.internal.processors.cache.persistence.tree.util.PageHandler.writeUnlock(PageHandler.java:394)
> 			at org.apache.ignite.internal.processors.cache.persistence.tree.util.PageHandler.writePage(PageHandler.java:369)
> 			at org.apache.ignite.internal.processors.cache.persistence.DataStructure.write(DataStructure.java:285)
> 			at org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree.access$11500(BPlusTree.java:92)
> 			at org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree$Put.tryReplace(BPlusTree.java:3638)
> 			at org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree.putDown(BPlusTree.java:2565)
> 			at org.apache.ignite.internal.processors.cache.persistence.tree.BPlusTree.doPut(BPlusTree.java:2293)
> 			... 33 more
> {code}
> 2) During binary recovery phase, page memory is changed by applying delta records and page snapshots (GridCacheDatabaseSharedManager#applyPageDelta, #applyPageSnapshot). Such changes are not replicated by logging delta records to WAL (we don't log physical records on binary recovery - we just apply already logged ones). This leads to false positive broken consistency reports. To prevent this, we should apply changes to both regular page memory and mock page memory in PageMemoryTracker.
> 3) PagesList.java:918:
> {code:java}
>                         // Here we should never write full page, because it is known to be new.
>                         if (needWalDeltaRecord(nextId, nextPage, FALSE))
>                             wal.log(new PagesListInitNewPageRecord(
>                                 grpId,
>                                 nextId,
>                                 io.getType(),
>                                 io.getVersion(),
>                                 nextId,
>                                 prevId,
>                                 0L
>                             ));
> {code}
> Sometimes we may log InitNewPageRecord without logging page snapshot. In case page was recycled before, its content in unused page part may trigger false positive reports. We may fix this by filling page content with zeros during PagesList#setupNextPage call.
> Modified test from attached patch should pass if all mentioned issues will be fixed. Please ensure that both Mockito and ignite-indexing are in classpath when you run the test.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)