You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Dragon Fly <dr...@hotmail.com> on 2006/04/21 19:48:45 UTC

Synonyms ...

Hi,

What is the best way to implement the following?

Document 1 contains the following text:
  "THE CZECH REPUBLIC ORGANIZATION"

Document 2 contains the following text:
  "THE CZE ORGANISATION"

Synonym rules:
  (1) CZECH REPUBLIC --> CZE
  (2) CZE --> CZECH REPUBLIC
  (3) ORGANIZATION --> ORG, ORGANISATION

All of the following phrase searches must match BOTH documents:
  "CZECH REPUBLIC ORGANIZATION"
  "CZECH REPUBLIC ORGANISATION"
  "CZECH REPUBLIC ORG"
  "CZE ORGANIZATION"
  "CZE ORGANISATION"
  "CZE ORG"

I don't think the SynonymAnalyzer described in LIA would work because
some of my "synonyms" contain multiple words.  Thank you.

_________________________________________________________________
Don’t just search. Find. Check out the new MSN Search! 
http://search.msn.click-url.com/go/onm00200636ave/direct/01/


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Re: Synonyms ...

Posted by Yonik Seeley <ys...@gmail.com>.
On 4/21/06, Dragon Fly <dr...@hotmail.com> wrote:
> I don't think the SynonymAnalyzer described in LIA would work because
> some of my "synonyms" contain multiple words.

The SynonymFilter in Solr can handle multi-word synonyms.

http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters
http://incubator.apache.org/solr/docs/api/org/apache/solr/analysis/SynonymFilter.html

-Yonik
http://incubator.apache.org/solr Solr, the open-source Lucene search server

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org