You are viewing a plain text version of this content. The canonical link for it is here.
Posted to oak-issues@jackrabbit.apache.org by "Amrit Verma (Jira)" <ji...@apache.org> on 2021/05/17 08:26:00 UTC

[jira] [Updated] (OAK-9434) MongoDB indexing: implement parallel chunk download

     [ https://issues.apache.org/jira/browse/OAK-9434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Amrit Verma updated OAK-9434:
-----------------------------
    Description: 
In case of large indexes, indexing takes a long time. In case of MongoDB Document store, Currently it is a two step process - download the data from mongodb then create index based on that data.

If something fails during this process, indexing needs to be restarted from beginning of the download step. We should make the indexing process resumable from the point it stopped. 

Since data download from mongodb seems to be more time taking than indexing itself, we first focus on download part. 

This Jira issue is for implementing resumable/parallel download.

> MongoDB indexing: implement parallel chunk download
> ---------------------------------------------------
>
>                 Key: OAK-9434
>                 URL: https://issues.apache.org/jira/browse/OAK-9434
>             Project: Jackrabbit Oak
>          Issue Type: Improvement
>          Components: indexing
>    Affects Versions: 1.38.0
>            Reporter: Amrit Verma
>            Assignee: Amrit Verma
>            Priority: Major
>
> In case of large indexes, indexing takes a long time. In case of MongoDB Document store, Currently it is a two step process - download the data from mongodb then create index based on that data.
> If something fails during this process, indexing needs to be restarted from beginning of the download step. We should make the indexing process resumable from the point it stopped. 
> Since data download from mongodb seems to be more time taking than indexing itself, we first focus on download part. 
> This Jira issue is for implementing resumable/parallel download.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)