You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Tomas Mikenda <tm...@email.cz> on 2003/06/24 17:42:40 UTC

German search anomalies

Hi all,

I have following problem. I am using lucene 1.3 rc1 (for 1.2 it is even worse), so I have German analyze which maps not only ä -> a but also ae -> a. But still result are strange in PrefixQuery search. It seams to me that in that case are not german letters stemmed. See example:

köln  -> 2 results (Köln; Köln Troisdorf)
koln  -> 2 results ( - // - )
koeln -> 2 results ( - // - )
köln* -> 1 result (Köln/Bonn/Airport)
koln* -> 0
koeln* -> 0

Indexed was exactly words in backets. Queries were done in small letters.
Can somebody give me an advice how to get always three results for those six queries?

Regards,
Tomas


________________________________________________________________________________
Aktuálně: Digitální foťáky a kamery za akční ceny v termínu od 16. do 25.6. na http://www.email.cz/obchodnidum


________________________________________________________________________________
Aktuálně: Digitální foťáky a kamery za akční ceny v termínu od 16. do 25.6. na http://www.email.cz/obchodnidum


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Re: German search anomalies

Posted by Otis Gospodnetic <ot...@yahoo.com>.
This may be the answer:
http://www.jguru.com/faq/view.jsp?EID=538312

--- Tomas Mikenda <tm...@email.cz> wrote:
> Hi all,
> 
> I have following problem. I am using lucene 1.3 rc1 (for 1.2 it is
> even worse), so I have German analyze which maps not only � -> a but
> also ae -> a. But still result are strange in PrefixQuery search. It
> seams to me that in that case are not german letters stemmed. See
> example:
> 
> k�ln  -> 2 results (K�ln; K�ln Troisdorf)
> koln  -> 2 results ( - // - )
> koeln -> 2 results ( - // - )
> k�ln* -> 1 result (K�ln/Bonn/Airport)
> koln* -> 0
> koeln* -> 0
> 
> Indexed was exactly words in backets. Queries were done in small
> letters.
> Can somebody give me an advice how to get always three results for
> those six queries?
> 
> Regards,
> Tomas
> 
> 
>
________________________________________________________________________________
> Aktu�ln�: Digit�ln� fo��ky a kamery za ak�n� ceny v term�nu od 16. do
> 25.6. na http://www.email.cz/obchodnidum
> 
> 
>
________________________________________________________________________________
> Aktu�ln�: Digit�ln� fo��ky a kamery za ak�n� ceny v term�nu od 16. do
> 25.6. na http://www.email.cz/obchodnidum
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
> 


__________________________________
Do you Yahoo!?
SBC Yahoo! DSL - Now only $29.95 per month!
http://sbc.yahoo.com

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org