You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by "ASF subversion and git services (Jira)" <ji...@apache.org> on 2021/07/29 06:47:00 UTC

[jira] [Commented] (LUCENE-10031) Speedup to SortedDocIDMerger when sorting on low-cardinality fields

    [ https://issues.apache.org/jira/browse/LUCENE-10031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17389289#comment-17389289 ] 

ASF subversion and git services commented on LUCENE-10031:
----------------------------------------------------------

Commit 0e6c3146d7853d27037213dc58eddc16a0e05daa in lucene's branch refs/heads/main from Adrien Grand
[ https://gitbox.apache.org/repos/asf?p=lucene.git;h=0e6c314 ]

LUCENE-10031: Speed up SortedDocIdMerger on low-cardinality sort fields. (#221)

When sorting by low-cardinality fields, the same sub remains current for long
sequences of doc IDs. This speeds up SortedDocIdMerger a bit by extracting
the sub that leads iteration.

> Speedup to SortedDocIDMerger when sorting on low-cardinality fields
> -------------------------------------------------------------------
>
>                 Key: LUCENE-10031
>                 URL: https://issues.apache.org/jira/browse/LUCENE-10031
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Adrien Grand
>            Priority: Minor
>          Time Spent: 50m
>  Remaining Estimate: 0h
>
> I've been looking at profiles of indexing with index sorting enabled and saw non-negligible time spent in SortedDocIDMerger. This isn't completely surprising as this little class is called on every document whenever merging postings, doc values, stored fields, etc.
> I'm especially interested in cases when the sort key is on a low cardinality field, so the priority queue doesn't get reordered often. I've been playing with a change to SortedDocIdMerger that makes merging significantly faster in that case.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org