You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@lucene.apache.org by to...@apache.org on 2020/02/29 09:16:18 UTC
[lucene-solr] branch branch_8x updated: SOLR-14295: Add the
parameter description about 'discardCompoundToken' for JapaneseTokenizer in
RefGuide
This is an automated email from the ASF dual-hosted git repository.
tomoko pushed a commit to branch branch_8x
in repository https://gitbox.apache.org/repos/asf/lucene-solr.git
The following commit(s) were added to refs/heads/branch_8x by this push:
new 3ab908a SOLR-14295: Add the parameter description about 'discardCompoundToken' for JapaneseTokenizer in RefGuide
new 983b5e9 Merge branch 'branch_8x' of github.com:apache/lucene-solr into branch_8x
3ab908a is described below
commit 3ab908afc0804abfee387973dc757d783d18fa9d
Author: Tomoko Uchida <to...@apache.org>
AuthorDate: Sat Feb 29 17:43:25 2020 +0900
SOLR-14295: Add the parameter description about 'discardCompoundToken' for JapaneseTokenizer in RefGuide
---
solr/solr-ref-guide/src/language-analysis.adoc | 2 ++
1 file changed, 2 insertions(+)
diff --git a/solr/solr-ref-guide/src/language-analysis.adoc b/solr/solr-ref-guide/src/language-analysis.adoc
index d82773f..fe1e1bc 100644
--- a/solr/solr-ref-guide/src/language-analysis.adoc
+++ b/solr/solr-ref-guide/src/language-analysis.adoc
@@ -1279,6 +1279,8 @@ For some applications it might be good to use `search` mode for indexing and `no
`discardPunctuation`:: set to `false` to keep punctuation, `true` to discard (the default)
+`discardCompoundToken`:: set to `false` to keep original compound tokens with the `search` mode, `true` to discard.
+
==== Japanese Base Form Filter
Replaces original terms' text with the corresponding base form (lemma). (`JapaneseTokenizer` annotates each term with its base form.)