You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Joe Calderon <ca...@gmail.com> on 2010/02/04 19:33:32 UTC

fuzzy matching / configurable distance function?

is it possible to configure the distance formula used by fuzzy
matching? i see there are other under the function query page under
strdist but im wondering if they are applicable to fuzzy matching

thx much


--joe

RE: fuzzy matching / configurable distance function?

Posted by Fuad Efendi <fu...@efendi.ca>.
Levenstein algo is currently hardcoded (FuzzyTermEnum class) in Lucene 2.9.1
and 3.0...
There are samples of other distance in "contrib" folder
If you want to play with distance, check
http://issues.apache.org/jira/browse/LUCENE-2230
It works if distance is integer and follows "metric space axioms":
D(a,b)=D(b,a)
D(a,b)+D(b,c)>=D(a,c)


Probably SOLR can provide more freedom with plugged-in distances...

-Fuad


> -----Original Message-----
> From: Joe Calderon [mailto:calderon.joe@gmail.com]
> Sent: February-04-10 2:34 PM
> To: solr-user@lucene.apache.org
> Subject: fuzzy matching / configurable distance function?
> 
> is it possible to configure the distance formula used by fuzzy
> matching? i see there are other under the function query page under
> strdist but im wondering if they are applicable to fuzzy matching
> 
> thx much
> 
> 
> --joe