You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by pi...@gmx.de on 2003/09/12 10:23:53 UTC

endOffset, startOffset of Token

Are the endOffset, startOffset fields of a Token used in 
proximity search and phrase search.

My problem is that I tried my own TokenStream and may have
messed up the offsets. What are the restrictions for a phrase
search to work? Do the offsets of tokens have to be exactly
adjacent?

  Thanks,
  Harald.

-- 
COMPUTERBILD 15/03: Premium-e-mail-Dienste im Test
--------------------------------------------------
1. GMX TopMail - Platz 1 und Testsieger!
2. GMX ProMail - Platz 2 und Preis-Qualitätssieger!
3. Arcor - 4. web.de - 5. T-Online - 6. freenet.de - 7. daybyday - 8. e-Post


Re: endOffset, startOffset of Token

Posted by Doug Cutting <cu...@lucene.com>.
pifpafpuf@gmx.de wrote:
> Are the endOffset, startOffset fields of a Token used in 
> proximity search and phrase search?

No.  There are not used by indexing or search.  Their intent is only to 
aid the extraction of matching text snippets when displaying results.

Doug