You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@lucy.apache.org by goran kent <go...@gmail.com> on 2011/10/04 14:24:19 UTC

[lucy-user] Possible highlighter bug when searching for duplicate [wordX wordX]?

Hi,

I stumbled onto this one when searching for [business to business].

Source <TITLE>: ...Companies, Products, Trade Leads, Business Marketplace

'to' is a stopword which is ignored - no problem.
So the query then becomes [business business].  The highlighter then produces:

...Companies, Products, Trade Leads, <strong>Business</strong>
<strong>Marketp</strong>lace

I then spent some time chasing my tail trying to reduce things down to
a small reproducible unit, and finally decided to try searching for
any duplicate [wordX wordX], and sure enough it's reproducible with
all my indexes.

Is this a bug or a problem on my side?

regards

Re: [lucy-user] Possible highlighter bug when searching for duplicate [wordX wordX]?

Posted by Marvin Humphrey <ma...@rectangular.com>.
On Tue, Oct 04, 2011 at 02:24:19PM +0200, goran kent wrote:
> ...Companies, Products, Trade Leads, <strong>Business</strong>
> <strong>Marketp</strong>lace

This has now been fixed both on both the main line and the 0.2.x branch.  It
will be in our next release.

Marvin Humphrey


Re: [lucy-user] Possible highlighter bug when searching for duplicate [wordX wordX]?

Posted by Marvin Humphrey <ma...@rectangular.com>.
On Tue, Oct 04, 2011 at 02:24:19PM +0200, goran kent wrote:

> ...Companies, Products, Trade Leads, <strong>Business</strong>
> <strong>Marketp</strong>lace

> Is this a bug or a problem on my side?

Definitely a bug in the loop of Highlighter_highlight_excerpt() in
Highlighter.c.  Please open a ticket.

Thanks for the clear explanation.

Marvin Humphrey