You are viewing a plain text version of this content. The canonical link for it is here.
Posted to oak-issues@jackrabbit.apache.org by "Marcel Reutegger (JIRA)" <ji...@apache.org> on 2015/02/02 11:06:34 UTC

[jira] [Commented] (OAK-2110) performance issues with VersionGarbageCollector

    [ https://issues.apache.org/jira/browse/OAK-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14301095#comment-14301095 ] 

Marcel Reutegger commented on OAK-2110:
---------------------------------------

I tested the time to get documents marked with {{_deletedOnce}} with the generic implementation and compared it with the mongo-specific one. The mongo-specific version is roughly ten times faster. I think this is mostly because it only has to execute the query once on MongoDB and can stream the entire result to Oak vs. multiple queries to get the matching documents in batches.

To solve this, we will probably have to change the signature of DocumentStore.query() or introduce a new method.

> performance issues with VersionGarbageCollector
> -----------------------------------------------
>
>                 Key: OAK-2110
>                 URL: https://issues.apache.org/jira/browse/OAK-2110
>             Project: Jackrabbit Oak
>          Issue Type: Bug
>          Components: mongomk
>            Reporter: Julian Reschke
>             Fix For: 1.1.7
>
>
> This one currently special-cases Mongo. For other persistences, it
> - fetches *all* documents
> - filters by SD_TYPE
> - filters by lastmod of versions
> - deletes what remains
> This is not only inefficient but also fails with OutOfMemory for any larger repo.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)