You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-commits@lucene.apache.org by Apache Wiki <wi...@apache.org> on 2010/10/12 00:31:23 UTC
[Solr Wiki] Trivial Update of "AnalyzersTokenizersTokenFilters" by RobertMuir
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.
The "AnalyzersTokenizersTokenFilters" page has been changed by RobertMuir.
The comment on this change is: asciifoldingfilter example and add caverphone to phonetics.
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters?action=diff&rev1=91&rev2=92
--------------------------------------------------
Converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the "Basic Latin" Unicode block) into their ASCII equivalents, if one exists.
+ {{{
+ <filter class="solr.ASCIIFoldingFilterFactory"/>
+ }}}
+
See the [[http://lucene.apache.org/java/2_9_1/api/all/org/apache/lucene/analysis/ASCIIFoldingFilter.html|ASCIIFoldingFilter Javadocs]] for more details.
<<Anchor(PhoneticFilterFactory)>>
@@ -511, +515 @@
Creates `org.apache.solr.analysis.PhoneticFilter`.
- Uses [[http://jakarta.apache.org/commons/codec/|commons codec]] to generate phonetically similar tokens. This currently supports [[http://jakarta.apache.org/commons/codec/api-release/org/apache/commons/codec/language/package-summary.html|four methods]].
+ Uses [[http://jakarta.apache.org/commons/codec/|commons codec]] to generate phonetically similar tokens. This currently supports [[http://jakarta.apache.org/commons/codec/api-release/org/apache/commons/codec/language/package-summary.html|five methods]].
||'''arg''' ||'''value''' ||
- ||encoder ||one of: [[http://jakarta.apache.org/commons/codec/api-release/org/apache/commons/codec/language/DoubleMetaphone.html|DoubleMetaphone]], [[http://jakarta.apache.org/commons/codec/api-release/org/apache/commons/codec/language/Metaphone.html|Metaphone]], [[http://jakarta.apache.org/commons/codec/api-release/org/apache/commons/codec/language/Soundex.html|Soundex]], [[http://jakarta.apache.org/commons/codec/api-release/org/apache/commons/codec/language/RefinedSoundex.html|RefinedSoundex]] ||
+ ||encoder ||one of: [[http://jakarta.apache.org/commons/codec/api-release/org/apache/commons/codec/language/DoubleMetaphone.html|DoubleMetaphone]], [[http://jakarta.apache.org/commons/codec/api-release/org/apache/commons/codec/language/Metaphone.html|Metaphone]], [[http://jakarta.apache.org/commons/codec/api-release/org/apache/commons/codec/language/Soundex.html|Soundex]], [[http://jakarta.apache.org/commons/codec/api-release/org/apache/commons/codec/language/RefinedSoundex.html|RefinedSoundex]], [[http://jakarta.apache.org/commons/codec/api-release/org/apache/commons/codec/language/Caverphone.html|Caverphone]] <!> [[Solr3.1]] ||
||inject ||true/false -- true will add tokens to the stream, false will replace the existing token ||
||maxCodeLength ||integer -- sets the maximum length of the code to be generated. Supported only for Metaphone and !DoubleMetaphone encodings ||