You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Steve Davids (JIRA)" <ji...@apache.org> on 2014/02/19 14:24:21 UTC
[jira] [Updated] (LUCENE-5455) Nested SpanNear queries lose
positional highlights
[ https://issues.apache.org/jira/browse/LUCENE-5455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Steve Davids updated LUCENE-5455:
---------------------------------
Attachment: LUCENE-5455-Tests.patch
Attached a patch including various test cases that demonstrates the problem.
> Nested SpanNear queries lose positional highlights
> --------------------------------------------------
>
> Key: LUCENE-5455
> URL: https://issues.apache.org/jira/browse/LUCENE-5455
> Project: Lucene - Core
> Issue Type: Bug
> Components: modules/highlighter
> Affects Versions: 4.3.1, 4.6.1
> Reporter: Steve Davids
> Fix For: 4.8, 5.0
>
> Attachments: LUCENE-5455-Tests.patch
>
>
> Given text of: "x y z x z x a"
> With a query of: spanNear([spanNear([text:x, text:y, text:z], 0, true), text:a], 10, false)
> Resulting highlight: <B>x</B> <B>y</B> <B>z</B> <B>x</B> <B>z</B> <B>x</B> <B>a</B>
> Expected highlight: <B>x</B> <B>y</B> <B>z</B> x z x <B>a</B>
> This is caused because WeightedSpanTermExtractor.extractWeightedSpanTerms takes the SpanQuery and flattens all terms and uses the positions from the outermost SpanNear clause (ignoring the nested SpanNear positions). I believe this could be resolved with a little recursion - walking the span query tree in the extractWeightedSpanTerms method.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org