You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "David Smiley (JIRA)" <ji...@apache.org> on 2016/11/30 19:59:58 UTC

[jira] [Created] (LUCENE-7578) UnifiedHighlighter: Convert PhraseHelper to use SpanCollector API

David Smiley created LUCENE-7578:
------------------------------------

             Summary: UnifiedHighlighter: Convert PhraseHelper to use SpanCollector API
                 Key: LUCENE-7578
                 URL: https://issues.apache.org/jira/browse/LUCENE-7578
             Project: Lucene - Core
          Issue Type: Improvement
          Components: modules/highlighter
            Reporter: David Smiley


The PhraseHelper of the UnifiedHighlighter currently collects position-spans per SpanQuery (and it knows which terms are in which SpanQuery), and then it filters PostingsEnum based on that.  It's similar to how the original Highlighter WSTE works.  The main problem with this approach is that it can be inaccurate for some nested span queries -- LUCENE-2287, LUCENE-5455 (has the clearest example), LUCENE-6796.  Non-nested SpanQueries (e.g. that which is converted from a PhraseQuery or MultiPhraseQuery) are _not_ a problem.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org