You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by "Adrien Grand (Jira)" <ji...@apache.org> on 2021/07/21 17:19:00 UTC

[jira] [Created] (LUCENE-10031) Speedup to SortedDocIDMerger when sorting on low-cardinality fields

Adrien Grand created LUCENE-10031:
-------------------------------------

             Summary: Speedup to SortedDocIDMerger when sorting on low-cardinality fields
                 Key: LUCENE-10031
                 URL: https://issues.apache.org/jira/browse/LUCENE-10031
             Project: Lucene - Core
          Issue Type: Improvement
            Reporter: Adrien Grand


I've been looking at profiles of indexing with index sorting enabled and saw non-negligible time spent in SortedDocIDMerger. This isn't completely surprising as this little class is called on every document whenever merging postings, doc values, stored fields, etc.

I'm especially interested in cases when the sort key is on a low cardinality field, so the priority queue doesn't get reordered often. I've been playing with a change to SortedDocIdMerger that makes merging significantly faster in that case.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org