You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@lucene.apache.org by ct...@apache.org on 2018/01/31 18:57:59 UTC

lucene-solr:branch_7x: SOLR-11835: Adjust Ukranian language example

Repository: lucene-solr
Updated Branches:
  refs/heads/branch_7x 505f7b9d5 -> 70a9e5b3f


SOLR-11835: Adjust Ukranian language example


Project: http://git-wip-us.apache.org/repos/asf/lucene-solr/repo
Commit: http://git-wip-us.apache.org/repos/asf/lucene-solr/commit/70a9e5b3
Tree: http://git-wip-us.apache.org/repos/asf/lucene-solr/tree/70a9e5b3
Diff: http://git-wip-us.apache.org/repos/asf/lucene-solr/diff/70a9e5b3

Branch: refs/heads/branch_7x
Commit: 70a9e5b3f58d06f1a7616a9b2c7f9681a7ecd5eb
Parents: 505f7b9
Author: Cassandra Targett <ct...@apache.org>
Authored: Wed Jan 31 12:56:50 2018 -0600
Committer: Cassandra Targett <ct...@apache.org>
Committed: Wed Jan 31 12:57:50 2018 -0600

----------------------------------------------------------------------
 solr/solr-ref-guide/src/language-analysis.adoc | 4 +---
 1 file changed, 1 insertion(+), 3 deletions(-)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/lucene-solr/blob/70a9e5b3/solr/solr-ref-guide/src/language-analysis.adoc
----------------------------------------------------------------------
diff --git a/solr/solr-ref-guide/src/language-analysis.adoc b/solr/solr-ref-guide/src/language-analysis.adoc
index 9d00325..8d6f734 100644
--- a/solr/solr-ref-guide/src/language-analysis.adoc
+++ b/solr/solr-ref-guide/src/language-analysis.adoc
@@ -1767,11 +1767,9 @@ Lucene also includes an example Ukrainian stopword list, in the `lucene-analyzer
 <analyzer>
   <tokenizer class="solr.StandardTokenizerFactory"/>
   <filter class="solr.StopFilterFactory" words="org/apache/lucene/analysis/uk/stopwords.txt"/>
-  <filter class="solr.MorfologikFilterFactory" dictionary="org/apache/lucene/analysis/uk/ukrainian.dict"/>
   <filter class="solr.LowerCaseFilterFactory"/>
+  <filter class="solr.MorfologikFilterFactory" dictionary="org/apache/lucene/analysis/uk/ukrainian.dict"/>
 </analyzer>
 ----
 
-Note the lower case filter is applied _after_ the Morfologik stemmer; this is because the Ukrainian dictionary contains proper names and then proper term case may be important to resolve disambiguities (or even lookup the correct lemma at all).
-
 The Morfologik `dictionary` param value is a constant specifying which dictionary to choose. The dictionary resource must be named `path/to/_language_.dict` and have an associated `.info` metadata file. See http://morfologik.blogspot.com/[the Morfologik project] for details. If the dictionary attribute is not provided, the Polish dictionary is loaded and used by default.