You are viewing a plain text version of this content. The canonical link for it is here.
Posted to oak-issues@jackrabbit.apache.org by "Alex Parvulescu (JIRA)" <ji...@apache.org> on 2014/06/16 15:44:01 UTC

[jira] [Updated] (OAK-1877) Hourly async reindexing on an idle instance

     [ https://issues.apache.org/jira/browse/OAK-1877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Alex Parvulescu updated OAK-1877:
---------------------------------

    Attachment: updates-without-indexed-changes.patch

there's a regression from the latest changes: content changes that are not indexed would generate a new checkpoint, but fail to properly link it from the 'async' reference, resulting in a large number of warning messages because of the missing initial checkpoint.

attached a proposed patch with some updates tests that also check the 'async' reference to the checkpoints.

[~jukkaz] can you take a look?

> Hourly async reindexing on an idle instance
> -------------------------------------------
>
>                 Key: OAK-1877
>                 URL: https://issues.apache.org/jira/browse/OAK-1877
>             Project: Jackrabbit Oak
>          Issue Type: Bug
>          Components: core
>    Affects Versions: 1.0
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>            Priority: Critical
>             Fix For: 1.0.1, 1.1
>
>         Attachments: updates-without-indexed-changes.patch
>
>
> OAK-1292 introduced the following interesting but not very nice behavior:
> On an idle system with no changes for an extended amount of time, the OAK-1292 change blocks the async indexer from updating the reference to the last indexed checkpoint. After one hour (the default checkpoint lifetime), the referenced checkpoint will expire, and the indexer will fall back to full reindexing.
> The result of this behavior is that once every hour, the size of an idle instance will grow with dozens or hundreds of megabytes of new index data generated by reindexing. Older index data becomes garbage, but the compaction code from OAK-1804 is needed to make it collectable. A better solution would be to prevent the reindexing from happening in the first place.



--
This message was sent by Atlassian JIRA
(v6.2#6252)