You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by "Natarajan, Rajeswari" <ra...@sap.com> on 2018/02/20 16:16:04 UTC
solr.DictionaryCompoundWordTokenFilterFactory filter and double
quotes
Hi,
We have below field type defined in our schema.xml to support the German Compound word search . This works find. But even when double quotes are there in the search term , it gets split . Is there a way not to split the term when double quotes are present in the query with this field type
<fieldType name="text_de_compoundword" class="solr.TextField" positionIncrementGap="100">
<analyzer>
<tokenizer class="solr.StandardTokenizerFactory"/>
<filter class="solr.LowerCaseFilterFactory" />
<!-- Case insensitive stop word removal. add enablePositionIncrements=true
in both the index and query analyzers to leave a 'gap' for more accurate
phrase queries. -->
<filter class="solr.StopFilterFactory" ignoreCase="true"
words="lang/de/stopwords.txt" format="snowball"
enablePositionIncrements="true"/>
<filter class="solr.SnowballPorterFilterFactory" language="German2"/>
<filter class="solr.DictionaryCompoundWordTokenFilterFactory" dictionary="lang/de/dictionary.txt"
minWordSize="5" minSubwordSize="2" maxSubwordSize="15" onlyLongestMatch="true"/>
</analyzer>
</fieldType>
Thanks in Advance,
Rajeswari