You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by "Kaival Parikh (Jira)" <ji...@apache.org> on 2022/06/09 05:52:00 UTC

[jira] [Created] (LUCENE-10606) Optimize hit collection of prefilter in KnnVectorQuery for BitSet backed queries

Kaival Parikh created LUCENE-10606:
--------------------------------------

             Summary: Optimize hit collection of prefilter in KnnVectorQuery for BitSet backed queries
                 Key: LUCENE-10606
                 URL: https://issues.apache.org/jira/browse/LUCENE-10606
             Project: Lucene - Core
          Issue Type: Improvement
          Components: core/search
            Reporter: Kaival Parikh


While working on this [PR|https://github.com/apache/lucene/pull/932] to add prefilter testing support, we saw that hit collection took a long time for BitSetIterator backed scorers (due to iteration over the entire underlying BitSet, and copying it into an internal one) (Link to [numbers|https://github.com/apache/lucene/pull/932#discussion_r888896850], second table)

These BitSetIterators can be frequent (as they are used in LRUQueryCache), and bulk collection can be optimized with more knowledge of the underlying iterator



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org