You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "David Smiley (JIRA)" <ji...@apache.org> on 2014/12/31 23:01:13 UTC

[jira] [Commented] (LUCENE-627) highlighter problems with overlapping tokens

    [ https://issues.apache.org/jira/browse/LUCENE-627?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14262458#comment-14262458 ] 

David Smiley commented on LUCENE-627:
-------------------------------------

[~ryguasu] HighlighterTest was affected by r422302.

I think that _either_ {{<B>Hi</B>-<B>Speed</B>10 foo}} OR {{<B>Hi-Speed</B>10 foo}} are acceptable results from a boolean query of "hi" and "speed".

> highlighter problems with overlapping tokens
> --------------------------------------------
>
>                 Key: LUCENE-627
>                 URL: https://issues.apache.org/jira/browse/LUCENE-627
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: core/other
>    Affects Versions: 2.1
>            Reporter: Yonik Seeley
>             Fix For: 2.1
>
>         Attachments: Highlighter.java.diff, highlight_overlap.diff
>
>
> The lucene highlighter has problems when tokens that overlap are generated.
> For example, if analysis of iPod generates the tokens "i", "pod", "ipod" (with pod and ipod in the same position),
> then the highlighter will output this as iipod, regardless of if any of those tokens are highlighted.
> Discovered via http://issues.apache.org/jira/browse/SOLR-24



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org