You are viewing a plain text version of this content. The canonical link for it is here.

Posted to oak-issues@jackrabbit.apache.org by "Thomas Mueller (JIRA)" <ji...@apache.org> on 2014/09/09 09:18:28 UTC

[jira] [Commented] (OAK-2082) Analyze repository growth with Lucene index on SegmentMk

    [ https://issues.apache.org/jira/browse/OAK-2082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14126716#comment-14126716 ] 

Thomas Mueller commented on OAK-2082:
-------------------------------------

Some numbers from a test repository I have (not compacted):

* 7 million segments in 3 tar files, of which are
* 4.3 million (146 GB) data segments, and
* 2.7 million (187 GB) binary segments.

To get those numbers, I have a list of entries of the tar files. I extracted the number of segments and sizes using "grep", "wc -l", "awk".

> Analyze repository growth with Lucene index on SegmentMk
> --------------------------------------------------------
>
>                 Key: OAK-2082
>                 URL: https://issues.apache.org/jira/browse/OAK-2082
>             Project: Jackrabbit Oak
>          Issue Type: Task
>          Components: run
>            Reporter: Chetan Mehrotra
>            Assignee: Chetan Mehrotra
>            Priority: Minor
>             Fix For: 1.1
>
>
> As discussed in [1] we should analyze repository growth along with Lucene index usage with various combinations
> # Default setup
> # SegmentMK + FileDataStore
> # SegmentMK + External Lucene Index usage
> [1] http://markmail.org/thread/s75ksd6gs4fhmghk



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)