You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Shri Kanish <sh...@ymail.com> on 2012/08/01 20:19:04 UTC

Solr spellcheck for words with quotes

Hi ,
I use solr as search engine for our application. WE have a title "Pandora's star". When I give a query as 
http://localhost:8983/solr/select?q=pandora's star&spellcheck=true &spellcheck.collate=true
 
I get response as below,

- <lst name="spellcheck">


- <lst name="suggestions">


- <lst name="pandora">


  <int name="numFound">1</int> 

  <int name="startOffset">10</int> 

  <int name="endOffset">17</int> 

- <arr name="suggestion">


  <str>pandora's</str> 
  </arr>
  </lst>

  <str name="collation">text_engb:pandora's's star</str> 
  </lst>
  </lst>
 
The word goes as pandora and not as pandora's. An additional  's is appended to the collation result. Below is my configuraion for spellcheck
 

<fieldType name="textSpell" class="solr.TextField" positionIncrementGap="100" omitNorms="true">
<analyzer type="index">
<tokenizer class="solr.StandardTokenizerFactory"/>
<filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords_selma.txt"/>
<filter class="solr.StandardFilterFactory"/> 
<filter class="solr.LowerCaseFilterFactory"/> 
<filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
</analyzer>
<analyzer type="query">
 <tokenizer class="solr.StandardTokenizerFactory"/> 
 
<filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords_selma.txt"/>
<filter class="solr.StandardFilterFactory"/> 
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
</analyzer>
</fieldType>
 
Please suggest
 
Thanks,
Shri