You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Vadim Kisselmann <v....@googlemail.com> on 2011/10/20 15:46:53 UTC

LUCENE-2208 (SOLR-1883) Bug with HTMLStripCharFilter, given patch in next nightly build?

Hello folks,

i have big problems with InvalidTokenOffsetExceptions with highlighting.
Looks like a bug in HTMLStripCharFilter.

H.Wang added a patch in LUCENE-2208, but nobody have time to look at this.
Could someone of the committers please take a look at this patch and commit
it or is this problem more complicated as i think? :)
Thanks guys...

Best Regards
Vadim

Re: LUCENE-2208 (SOLR-1883) Bug with HTMLStripCharFilter, given patch in next nightly build?

Posted by Vadim Kisselmann <v....@googlemail.com>.
UPDATE:
i checked out the latest trunk-version and patched this with the patch from
LUCENE-2208.
This patch seems not to work. Or i had done something wrong.

My old log snippets:

Http - 500 Internal Server Error
Error: Carrot2 clustering failed........

And this was caused by:
Http - 500 Internal Server Error
Error: org.apache.lucene.search.highlight.InvalidTokenOffsetsException:
Token the exceeds length of provided text sized 41

Best Regards
Vadim





2011/10/20 Vadim Kisselmann <v....@googlemail.com>

> Hello folks,
>
> i have big problems with InvalidTokenOffsetExceptions with highlighting.
> Looks like a bug in HTMLStripCharFilter.
>
> H.Wang added a patch in LUCENE-2208, but nobody have time to look at this.
> Could someone of the committers please take a look at this patch and commit
> it or is this problem more complicated as i think? :)
> Thanks guys...
>
> Best Regards
> Vadim
>
>
>