You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Marc Morissette (JIRA)" <ji...@apache.org> on 2018/06/20 02:48:00 UTC

[jira] [Commented] (LUCENE-8365) ArrayIndexOutOfBoundsException in UnifiedHighlighter

    [ https://issues.apache.org/jira/browse/LUCENE-8365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16517729#comment-16517729 ] 

Marc Morissette commented on LUCENE-8365:
-----------------------------------------

The fix is in Github

> ArrayIndexOutOfBoundsException in UnifiedHighlighter
> ----------------------------------------------------
>
>                 Key: LUCENE-8365
>                 URL: https://issues.apache.org/jira/browse/LUCENE-8365
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: modules/highlighter
>    Affects Versions: 7.3.1
>            Reporter: Marc Morissette
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> We see ArrayIndexOutOfBoundsExceptions coming out of the UnifiedHighlighter in our production logs from time to time:
> {code}
> java.lang.ArrayIndexOutOfBoundsException
> 	at java.base/java.lang.System.arraycopy(Native Method)
> 	at org.apache.lucene.search.uhighlight.PhraseHelper$SpanCollectedOffsetsEnum.add(PhraseHelper.java:386)
> 	at org.apache.lucene.search.uhighlight.PhraseHelper$OffsetSpanCollector.collectLeaf(PhraseHelper.java:341)
> 	at org.apache.lucene.search.spans.TermSpans.collect(TermSpans.java:121)
> 	at org.apache.lucene.search.spans.NearSpansOrdered.collect(NearSpansOrdered.java:149)
> 	at org.apache.lucene.search.spans.NearSpansUnordered.collect(NearSpansUnordered.java:171)
> 	at org.apache.lucene.search.spans.FilterSpans.collect(FilterSpans.java:120)
> 	at org.apache.lucene.search.uhighlight.PhraseHelper.createOffsetsEnumsForSpans(PhraseHelper.java:261)
> ...
> {code}
> It turns out that there is an "off by one" error in the UnifiedHighlighter's code that, as far as I can tell, is only triggered when two nested SpanNearQueries contain the same term.
> The resulting behaviour depends on the content of the highlighted document. Either, some highlighted terms go missing or an ArrayIndexOutOfBoundsException is thrown.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org