You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Tomas Mikenda <tm...@email.cz> on 2003/06/24 17:42:40 UTC
German search anomalies
Hi all,
I have following problem. I am using lucene 1.3 rc1 (for 1.2 it is even worse), so I have German analyze which maps not only ä -> a but also ae -> a. But still result are strange in PrefixQuery search. It seams to me that in that case are not german letters stemmed. See example:
köln -> 2 results (Köln; Köln Troisdorf)
koln -> 2 results ( - // - )
koeln -> 2 results ( - // - )
köln* -> 1 result (Köln/Bonn/Airport)
koln* -> 0
koeln* -> 0
Indexed was exactly words in backets. Queries were done in small letters.
Can somebody give me an advice how to get always three results for those six queries?
Regards,
Tomas
________________________________________________________________________________
Aktuálně: Digitální foťáky a kamery za akční ceny v termínu od 16. do 25.6. na http://www.email.cz/obchodnidum
________________________________________________________________________________
Aktuálně: Digitální foťáky a kamery za akční ceny v termínu od 16. do 25.6. na http://www.email.cz/obchodnidum
---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org
Re: German search anomalies
Posted by Otis Gospodnetic <ot...@yahoo.com>.
This may be the answer:
http://www.jguru.com/faq/view.jsp?EID=538312
--- Tomas Mikenda <tm...@email.cz> wrote:
> Hi all,
>
> I have following problem. I am using lucene 1.3 rc1 (for 1.2 it is
> even worse), so I have German analyze which maps not only � -> a but
> also ae -> a. But still result are strange in PrefixQuery search. It
> seams to me that in that case are not german letters stemmed. See
> example:
>
> k�ln -> 2 results (K�ln; K�ln Troisdorf)
> koln -> 2 results ( - // - )
> koeln -> 2 results ( - // - )
> k�ln* -> 1 result (K�ln/Bonn/Airport)
> koln* -> 0
> koeln* -> 0
>
> Indexed was exactly words in backets. Queries were done in small
> letters.
> Can somebody give me an advice how to get always three results for
> those six queries?
>
> Regards,
> Tomas
>
>
>
________________________________________________________________________________
> Aktu�ln�: Digit�ln� fo��ky a kamery za ak�n� ceny v term�nu od 16. do
> 25.6. na http://www.email.cz/obchodnidum
>
>
>
________________________________________________________________________________
> Aktu�ln�: Digit�ln� fo��ky a kamery za ak�n� ceny v term�nu od 16. do
> 25.6. na http://www.email.cz/obchodnidum
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
>
__________________________________
Do you Yahoo!?
SBC Yahoo! DSL - Now only $29.95 per month!
http://sbc.yahoo.com
---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org