You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-commits@lucene.apache.org by Apache Wiki <wi...@apache.org> on 2011/03/15 23:44:17 UTC

[Solr Wiki] Update of "AnalyzersTokenizersTokenFilters" by PaulLibbrecht

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The "AnalyzersTokenizersTokenFilters" page has been changed by PaulLibbrecht.
The comment on this change is: added how to get the analyzer from a component.
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters?action=diff&rev1=110&rev2=111

--------------------------------------------------

  When a document is indexed, its individual fields are subject to the analyzing and tokenizing filters that can transform and normalize the data in the fields. For example — removing blank spaces, removing html code, stemming, removing a particular character and replacing it with another. At indexing time as well as at query time you may need to do some of the above or similiar operations. For example, you might perform a [[http://en.wikipedia.org/wiki/Soundex|Soundex]] transformation (a type of phonic hashing) on a string to enable a search based upon the word and upon its 'sound-alikes'.
  
  The lists below provide an overview of '''''some''''' of the more heavily used Tokenizers and !TokenFilters provided by Solr "out of the box" along with tips/examples of using them.  '''This list should by no means be considered the "complete" list of all Analysis classes available in Solr!'''  In addition to new classes being added on an ongoing basis, you can load your own custom Analysis code as a [[SolrPlugins|Plugin]].
+ 
+ Analyzers, per field, are configured in the [[Schema]] and can be accessed by the search components using the [[http://lucene.apache.org/solr/api/org/apache/solr/handler/component/ResponseBuilder.html|ResponseBuilder]]'s getSchema method which can give the field-type by field-name, hence the analyzer by field-name.
  
  For a more complete list of what Tokenizers and !TokenFilters come out of the box, please consult the [[http://lucene.apache.org/solr/api/org/apache/solr/analysis/package-summary.html|javadocs]] for the analysis package.  if you have any tips/tricks you'd like to mention about using any of these classes, please add them below.