You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Pradeep Kumar K <pr...@robosoftin.com> on 2002/07/09 08:39:24 UTC
Tokenization
Hi All
In Lucene the tokenization of Sentences is happening word wise not
letter wise, So when we search for a letter which contained in a
sentence it will search for a word like the like the letter we entered.
Example : sentence "life is a big stage where we are actors" If I
search for life or any other word lucene seems to be returning correct
results. But if I search for "l" or 's' it will return no results.
I am not sure whether there are any methods which tokenizes the sentence
into letters. If anybody knows please put it in the mailing list
Best wishes
Pradeep
--------------------------------------------------------------
Robosoft Technologies - Partners in Product Development
--
To unsubscribe, e-mail: <ma...@jakarta.apache.org>
For additional commands, e-mail: <ma...@jakarta.apache.org>
Re: Tokenization
Posted by Peter Carlson <ca...@bookandhammer.com>.
Using a wildcard search might be useful for you.
So searching for "l*" will find life.
I hope this helps.
--Peter
On 7/8/02 11:39 PM, "Pradeep Kumar K" <pr...@robosoftin.com> wrote:
>
> Hi All
>
> In Lucene the tokenization of Sentences is happening word wise not
> letter wise, So when we search for a letter which contained in a
> sentence it will search for a word like the like the letter we entered.
> Example : sentence "life is a big stage where we are actors" If I
> search for life or any other word lucene seems to be returning correct
> results. But if I search for "l" or 's' it will return no results.
>
> I am not sure whether there are any methods which tokenizes the sentence
> into letters. If anybody knows please put it in the mailing list
>
> Best wishes
> Pradeep
>
>
>
> --------------------------------------------------------------
> Robosoft Technologies - Partners in Product Development
>
>
>
> --
> To unsubscribe, e-mail: <ma...@jakarta.apache.org>
> For additional commands, e-mail: <ma...@jakarta.apache.org>
>
>
--
To unsubscribe, e-mail: <ma...@jakarta.apache.org>
For additional commands, e-mail: <ma...@jakarta.apache.org>