You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Steven Pannell <st...@zooplus.com> on 2006/01/18 09:26:53 UTC
SoundEx
Hi,
Does anyone know if there is a SoundEx analyzer implementation available for
lucene???
thanks.
Steve,
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
Re: SoundEx
Posted by Erik Hatcher <er...@ehatchersolutions.com>.
On Jan 18, 2006, at 3:26 AM, Steven Pannell wrote:
> Does anyone know if there is a SoundEx analyzer implementation
> available for
> lucene???
There is a Metaphone filter and some sample analyzers that use it in
the Lucene in Action codebase. I toyed with various ones from
Jakarta Commons Codec and thought Metaphone did a better job than
Soundex. The implementation is pretty trivial, so it'd be easy to
switch to Soundex. Just grab the code download from http://
www.lucenebook.com
Erik
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
Re: SoundEx
Posted by ms...@aol.com.
You can also look at Phonetix which has many implementations of this...
-----Original Message-----
From: Erik Hatcher <er...@ehatchersolutions.com>
To: java-user@lucene.apache.org
Sent: Wed, 18 Jan 2006 05:41:30 -0500
Subject: Re: SoundEx
On Jan 18, 2006, at 4:20 AM, Christian Reuschling wrote:
> yes, look at the 'contributions' link at the lucene-homepage.
> The 'Phonetix'-project provides an implementation for soudex,
> metaphor and double-metaphor. Simply use their analyzer. I am
> not sure what the behaviour is in the case of wildcards. Have
> anyone an answer?
You can always index the original term in the same position as an encoded (soundex, metaphone, etc) term, or into a different field.
To be really clever, I suppose, you'd want to enumerate all terms that match the wildcard expression, then look up the ones that "sound like" those.
Erik
>
> regards
>
> Christian
>
>
> Steven Pannell wrote:
>> Hi,
>> Does anyone know if there is a SoundEx analyzer implementation >> available for
>> lucene???
>> thanks.
>> Steve,
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>> For additional commands, e-mail: java-user-help@lucene.apache.org
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
Re: SoundEx
Posted by Erik Hatcher <er...@ehatchersolutions.com>.
On Jan 18, 2006, at 4:20 AM, Christian Reuschling wrote:
> yes, look at the 'contributions' link at the lucene-homepage.
> The 'Phonetix'-project provides an implementation for soudex,
> metaphor and double-metaphor. Simply use their analyzer. I am
> not sure what the behaviour is in the case of wildcards. Have
> anyone an answer?
You can always index the original term in the same position as an
encoded (soundex, metaphone, etc) term, or into a different field.
To be really clever, I suppose, you'd want to enumerate all terms
that match the wildcard expression, then look up the ones that "sound
like" those.
Erik
>
> regards
>
> Christian
>
>
> Steven Pannell wrote:
>> Hi,
>> Does anyone know if there is a SoundEx analyzer implementation
>> available for
>> lucene???
>> thanks.
>> Steve,
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>> For additional commands, e-mail: java-user-help@lucene.apache.org
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
Re: SoundEx
Posted by Christian Reuschling <re...@dfki.de>.
yes, look at the 'contributions' link at the lucene-homepage.
The 'Phonetix'-project provides an implementation for soudex,
metaphor and double-metaphor. Simply use their analyzer. I am
not sure what the behaviour is in the case of wildcards. Have
anyone an answer?
regards
Christian
Steven Pannell wrote:
> Hi,
>
> Does anyone know if there is a SoundEx analyzer implementation available for
> lucene???
>
> thanks.
> Steve,
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org