You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by "Natarajan, Rajeswari" <ra...@sap.com> on 2018/02/20 16:16:04 UTC

solr.DictionaryCompoundWordTokenFilterFactory filter and double quotes

 Hi,

We have below field type defined in our schema.xml  to support  the German Compound  word search .  This works find. But even when double quotes are there in the search term , it gets split . Is there a way not to split the term when double quotes are present in the query with this field type

<fieldType name="text_de_compoundword" class="solr.TextField" positionIncrementGap="100">
             <analyzer>
                  <tokenizer class="solr.StandardTokenizerFactory"/>
                  <filter class="solr.LowerCaseFilterFactory" />
                  <!-- Case insensitive stop word removal. add enablePositionIncrements=true
                        in both the index and query analyzers to leave a 'gap' for more accurate
                        phrase queries. -->
                <filter class="solr.StopFilterFactory" ignoreCase="true"
                        words="lang/de/stopwords.txt" format="snowball"
                        enablePositionIncrements="true"/>
                 <filter class="solr.SnowballPorterFilterFactory" language="German2"/>                 
                 <filter class="solr.DictionaryCompoundWordTokenFilterFactory" dictionary="lang/de/dictionary.txt"
                  minWordSize="5" minSubwordSize="2" maxSubwordSize="15" onlyLongestMatch="true"/>
             </analyzer>
        </fieldType>  


Thanks in Advance,
Rajeswari