You are viewing a plain text version of this content. The canonical link for it is here.
Posted to oak-issues@jackrabbit.apache.org by "Julian Reschke (Jira)" <ji...@apache.org> on 2022/12/16 16:05:00 UTC

[jira] [Commented] (OAK-9790) Implement parallel indexing for speeding up oak run indexing command

    [ https://issues.apache.org/jira/browse/OAK-9790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17648707#comment-17648707 ] 

Julian Reschke commented on OAK-9790:
-------------------------------------

trunk: [04146eeb19|https://github.com/apache/jackrabbit-oak/commit/04146eeb19bc2f24e29b169fd52f55bac85fce8b] (1.46.0) [ed3936bbac|https://github.com/apache/jackrabbit-oak/commit/ed3936bbacee98920eb39835dee7209c7b49f9c7]


> Implement parallel indexing for speeding up oak run indexing command
> --------------------------------------------------------------------
>
>                 Key: OAK-9790
>                 URL: https://issues.apache.org/jira/browse/OAK-9790
>             Project: Jackrabbit Oak
>          Issue Type: Story
>            Reporter: Jun Zhang
>            Assignee: Amit Jain
>            Priority: Major
>             Fix For: 1.46.0
>
>
> Implement parallel indexing for speeding up oak run indexing command
> Since indexing was single threads, which is slow for large repository. In order to improve the indexing performance we need to implement parallel indexing.
> The work is cover for both lucene and elastic indexing. In order to support parallel indexing, it need to split the big flat file store file ahead, which add a big overhead, but make parallel index possible and much faster.
> Another change together is support the LZ4 compression since which is much faster compare to gzip.
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)