You are viewing a plain text version of this content. The canonical link for it is here.
Posted to oak-issues@jackrabbit.apache.org by "Thomas Mueller (JIRA)" <ji...@apache.org> on 2014/09/09 09:18:28 UTC
[jira] [Commented] (OAK-2082) Analyze repository growth with Lucene
index on SegmentMk
[ https://issues.apache.org/jira/browse/OAK-2082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14126716#comment-14126716 ]
Thomas Mueller commented on OAK-2082:
-------------------------------------
Some numbers from a test repository I have (not compacted):
* 7 million segments in 3 tar files, of which are
* 4.3 million (146 GB) data segments, and
* 2.7 million (187 GB) binary segments.
To get those numbers, I have a list of entries of the tar files. I extracted the number of segments and sizes using "grep", "wc -l", "awk".
> Analyze repository growth with Lucene index on SegmentMk
> --------------------------------------------------------
>
> Key: OAK-2082
> URL: https://issues.apache.org/jira/browse/OAK-2082
> Project: Jackrabbit Oak
> Issue Type: Task
> Components: run
> Reporter: Chetan Mehrotra
> Assignee: Chetan Mehrotra
> Priority: Minor
> Fix For: 1.1
>
>
> As discussed in [1] we should analyze repository growth along with Lucene index usage with various combinations
> # Default setup
> # SegmentMK + FileDataStore
> # SegmentMK + External Lucene Index usage
> [1] http://markmail.org/thread/s75ksd6gs4fhmghk
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)