You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Tim Allison (JIRA)" <ji...@apache.org> on 2014/03/07 21:32:42 UTC

[jira] [Created] (LUCENE-5503) Trivial fixes to WeightedSpanTermExtractor

Tim Allison created LUCENE-5503:
-----------------------------------

             Summary: Trivial fixes to WeightedSpanTermExtractor
                 Key: LUCENE-5503
                 URL: https://issues.apache.org/jira/browse/LUCENE-5503
             Project: Lucene - Core
          Issue Type: Bug
          Components: modules/highlighter
    Affects Versions: 5.0
            Reporter: Tim Allison
            Priority: Minor
         Attachments: LUCENE-5503.patch

The conversion of PhraseQuery to SpanNearQuery miscalculates the slop if there are stop words in some cases.  The issue only really appears if there is more than one intervening run of stop words: ab the cd the the ef.

I also noticed that the inOrder determination is based on the newly calculated slop, and it should probably be based on the original phraseQuery.getSlop()

patch and unit tests on way



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org