You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Sean O'Connor <se...@oconeco.com> on 2005/09/07 07:34:18 UTC

Re: Hits document offset information? Span query or Surround? - thanks

Thanks for the input. I am looking at the suggested links now. If I make 
any progress I will return to see if any of my work would be appropriate 
to contribute back.

Sean


Paul Elschot wrote:

>On Tuesday 06 September 2005 08:52, markharw00d wrote:
>  
>
>> >>I believe I have heard that Span queries provide some way to access 
>>document offset information for their hits somehow.
>>
>>See http://marc.theaimsgroup.com/?l=lucene-user&m=112496111224218&w=2
>>
>>Faithfully selecting extracts based *exactly* on query criteria will be 
>>hard given complex queries eg with nested Boolean logic.
>>
>>The current highlighter matches based on ANY query terms found in the 
>>provided doc text
>>The proposal above matches based on any spans/phrases/terms
>>
>>Both options still fail to take into account any boolean logic and show 
>>the real basis for the match eg the query
>>    (author:"Doug Cutting"AND title:"Lucene in Action") OR (author:Erik 
>>AND author:Otis)
>>would still highlight references to "Doug Cutting" and "Lucene In 
>>Action" for the LIA book, despite the fact that the match was actually 
>>for Erik and Otis (the true authors).
>>For most people this is a problem they can live with.
>>    
>>
>
>The person who solves that might also write a SpanAndQuery :)
>
>Regards,
>Paul Elschot
>
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>For additional commands, e-mail: java-user-help@lucene.apache.org
>
>
>  
>



---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org