You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Pradeep Kumar K <pr...@robosoftin.com> on 2002/07/09 08:39:24 UTC

Tokenization

Hi All

In Lucene the tokenization of Sentences is happening word wise not 
letter wise, So when we search for a letter which contained in a 
sentence it will search for a word like the like the letter we entered. 
Example :  sentence "life is a big stage where we are actors" If I 
search for life or any other word lucene seems to be returning correct 
results. But if I search for  "l" or 's' it will return no results.

I am not sure whether there are any methods which tokenizes the sentence 
into letters. If  anybody knows please put it in the mailing list

Best wishes
Pradeep



--------------------------------------------------------------
Robosoft Technologies - Partners in Product Development



--
To unsubscribe, e-mail:   <ma...@jakarta.apache.org>
For additional commands, e-mail: <ma...@jakarta.apache.org>


Re: Tokenization

Posted by Peter Carlson <ca...@bookandhammer.com>.
Using a wildcard search might be useful for you.

So searching for "l*" will find life.

I hope this helps.
--Peter


On 7/8/02 11:39 PM, "Pradeep Kumar K" <pr...@robosoftin.com> wrote:

> 
> Hi All
> 
> In Lucene the tokenization of Sentences is happening word wise not
> letter wise, So when we search for a letter which contained in a
> sentence it will search for a word like the like the letter we entered.
> Example :  sentence "life is a big stage where we are actors" If I
> search for life or any other word lucene seems to be returning correct
> results. But if I search for  "l" or 's' it will return no results.
> 
> I am not sure whether there are any methods which tokenizes the sentence
> into letters. If  anybody knows please put it in the mailing list
> 
> Best wishes
> Pradeep
> 
> 
> 
> --------------------------------------------------------------
> Robosoft Technologies - Partners in Product Development
> 
> 
> 
> --
> To unsubscribe, e-mail:   <ma...@jakarta.apache.org>
> For additional commands, e-mail: <ma...@jakarta.apache.org>
> 
> 


--
To unsubscribe, e-mail:   <ma...@jakarta.apache.org>
For additional commands, e-mail: <ma...@jakarta.apache.org>