You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by "ASF subversion and git services (Jira)" <ji...@apache.org> on 2021/06/10 15:04:00 UTC

[jira] [Commented] (LUCENE-9935) Bulk merges for stored fields when index sorting is enabled

    [ https://issues.apache.org/jira/browse/LUCENE-9935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17360996#comment-17360996 ] 

ASF subversion and git services commented on LUCENE-9935:
---------------------------------------------------------

Commit 54fb21e862c2041cb907517ed993c8ece898cb26 in lucene's branch refs/heads/main from Nhat Nguyen
[ https://gitbox.apache.org/repos/asf?p=lucene.git;h=54fb21e ]

LUCENE-9935: Enable bulk-merge for term vectors with index sort (#140)

This change enables bulk-merge for term vectors with index sort. The 
algorithm used here is similar to the one that is used to merge stored
fields.

Relates #134

> Bulk merges for stored fields when index sorting is enabled
> -----------------------------------------------------------
>
>                 Key: LUCENE-9935
>                 URL: https://issues.apache.org/jira/browse/LUCENE-9935
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Adrien Grand
>            Assignee: Nhat Nguyen
>            Priority: Minor
>             Fix For: 8.9, 9.0
>
>          Time Spent: 4h 20m
>  Remaining Estimate: 0h
>
> Today stored fields disable bulk merges entirely when index sorting is enabled. However when sorting by low-cardinality fields or when the index sort is correlated with the order in which documents get indexed, we could likely still have efficient bulk merges.
> For instance, if you are merging two segments that are sorted on a field that can only take 2 values, one could bulk merge the first half of the first segment, then the first half of the second segment, then the second half of the first segment, and finally the second half of the second segment.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org