You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Sean O'Connor <se...@oconeco.com> on 2005/09/07 07:34:18 UTC
Re: Hits document offset information? Span query or Surround? - thanks
Thanks for the input. I am looking at the suggested links now. If I make
any progress I will return to see if any of my work would be appropriate
to contribute back.
Sean
Paul Elschot wrote:
>On Tuesday 06 September 2005 08:52, markharw00d wrote:
>
>
>> >>I believe I have heard that Span queries provide some way to access
>>document offset information for their hits somehow.
>>
>>See http://marc.theaimsgroup.com/?l=lucene-user&m=112496111224218&w=2
>>
>>Faithfully selecting extracts based *exactly* on query criteria will be
>>hard given complex queries eg with nested Boolean logic.
>>
>>The current highlighter matches based on ANY query terms found in the
>>provided doc text
>>The proposal above matches based on any spans/phrases/terms
>>
>>Both options still fail to take into account any boolean logic and show
>>the real basis for the match eg the query
>> (author:"Doug Cutting"AND title:"Lucene in Action") OR (author:Erik
>>AND author:Otis)
>>would still highlight references to "Doug Cutting" and "Lucene In
>>Action" for the LIA book, despite the fact that the match was actually
>>for Erik and Otis (the true authors).
>>For most people this is a problem they can live with.
>>
>>
>
>The person who solves that might also write a SpanAndQuery :)
>
>Regards,
>Paul Elschot
>
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>For additional commands, e-mail: java-user-help@lucene.apache.org
>
>
>
>
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org