You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@lucene.apache.org by ct...@apache.org on 2018/01/31 18:57:59 UTC
lucene-solr:branch_7x: SOLR-11835: Adjust Ukranian language example
Repository: lucene-solr
Updated Branches:
refs/heads/branch_7x 505f7b9d5 -> 70a9e5b3f
SOLR-11835: Adjust Ukranian language example
Project: http://git-wip-us.apache.org/repos/asf/lucene-solr/repo
Commit: http://git-wip-us.apache.org/repos/asf/lucene-solr/commit/70a9e5b3
Tree: http://git-wip-us.apache.org/repos/asf/lucene-solr/tree/70a9e5b3
Diff: http://git-wip-us.apache.org/repos/asf/lucene-solr/diff/70a9e5b3
Branch: refs/heads/branch_7x
Commit: 70a9e5b3f58d06f1a7616a9b2c7f9681a7ecd5eb
Parents: 505f7b9
Author: Cassandra Targett <ct...@apache.org>
Authored: Wed Jan 31 12:56:50 2018 -0600
Committer: Cassandra Targett <ct...@apache.org>
Committed: Wed Jan 31 12:57:50 2018 -0600
----------------------------------------------------------------------
solr/solr-ref-guide/src/language-analysis.adoc | 4 +---
1 file changed, 1 insertion(+), 3 deletions(-)
----------------------------------------------------------------------
http://git-wip-us.apache.org/repos/asf/lucene-solr/blob/70a9e5b3/solr/solr-ref-guide/src/language-analysis.adoc
----------------------------------------------------------------------
diff --git a/solr/solr-ref-guide/src/language-analysis.adoc b/solr/solr-ref-guide/src/language-analysis.adoc
index 9d00325..8d6f734 100644
--- a/solr/solr-ref-guide/src/language-analysis.adoc
+++ b/solr/solr-ref-guide/src/language-analysis.adoc
@@ -1767,11 +1767,9 @@ Lucene also includes an example Ukrainian stopword list, in the `lucene-analyzer
<analyzer>
<tokenizer class="solr.StandardTokenizerFactory"/>
<filter class="solr.StopFilterFactory" words="org/apache/lucene/analysis/uk/stopwords.txt"/>
- <filter class="solr.MorfologikFilterFactory" dictionary="org/apache/lucene/analysis/uk/ukrainian.dict"/>
<filter class="solr.LowerCaseFilterFactory"/>
+ <filter class="solr.MorfologikFilterFactory" dictionary="org/apache/lucene/analysis/uk/ukrainian.dict"/>
</analyzer>
----
-Note the lower case filter is applied _after_ the Morfologik stemmer; this is because the Ukrainian dictionary contains proper names and then proper term case may be important to resolve disambiguities (or even lookup the correct lemma at all).
-
The Morfologik `dictionary` param value is a constant specifying which dictionary to choose. The dictionary resource must be named `path/to/_language_.dict` and have an associated `.info` metadata file. See http://morfologik.blogspot.com/[the Morfologik project] for details. If the dictionary attribute is not provided, the Polish dictionary is loaded and used by default.