You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by kirpakaro <kh...@yahoo.com> on 2012/12/21 00:29:13 UTC

Japanese exact match results do not show on top of results

Hi folks,

    I am having couple of problems with Japanese data, 1. it is not properly
indexing all the data 2. displaying the exact match result on top and then
90%match and 80%match etc. does not work.
 I am using solr3.6.1 and using text_ja as the fieldType here is the schema 


   <field name="q" type="text_ja" indexed="true" stored="true" />
   <field name="qs" type="text_general" indexed="false" stored="true"
multiValued="true"/>
   <field name="q_e" type="string" indexed="true" stored="true" />

 <copyField source="q" dest="q_e" maxChars="250"/>

what I want to achieve is that if there is an exact query match it should
provide the results from q_e followed by results from partial match from q
field and if there is nothing in q_e field then partial matches should come
from q field.  This is how I specify the query

http://localhost:7983/zoom/jp/select/?q=鹿児島
鹿児島銀行&rows=10&version=2.2&qf=query+query_exact^1&mm=90%25&pf=q^1+q_e^10
OR
&version=2.2&rows=10&qf=q+q_e^1&pf=query^10+query_exact^1

somehow the exact query matches results do not come on top, though the data
contains it. It is puzzling that all the documents do not get indexed
properly, but if I change the q field to string and q_e to text_ja then all
the records are indexed properly, but that still does not solve the problem
of exact match on top followed by partial matches.

text_ja field uses:
<filter class="solr.JapaneseBaseFormFilterFactory"/>
<filter class="solr.JapanesePartOfSpeechStopFilterFactory"
tags="../../../solr/conf/lang/stoptags_ja.txt"
enablePositionIncrements="true"/>
  <filter class="solr.CJKWidthFilterFactory"/>
<filter class="solr.StopFilterFactory" ignoreCase="true"
words="../../../solr/conf/lang/stopwords_ja.txt"
enablePositionIncrements="true" />
 <filter class="solr.JapaneseKatakanaStemFilterFactory" minimumLength="4"/>
  <filter class="solr.LowerCaseFilterFactory"/>

     How to solve this problem, 

Thanks










--
View this message in context: http://lucene.472066.n3.nabble.com/Japanese-exact-match-results-do-not-show-on-top-of-results-tp4028422.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: Japanese exact match results do not show on top of results

Posted by Robert Muir <rc...@gmail.com>.
I think you are hitting solr-3589. There is a vote underway for a 3.6.2
that contains this fix
On Dec 20, 2012 6:29 PM, "kirpakaro" <kh...@yahoo.com> wrote:

> Hi folks,
>
>     I am having couple of problems with Japanese data, 1. it is not
> properly
> indexing all the data 2. displaying the exact match result on top and then
> 90%match and 80%match etc. does not work.
>  I am using solr3.6.1 and using text_ja as the fieldType here is the schema
>
>
>    <field name="q" type="text_ja" indexed="true" stored="true" />
>    <field name="qs" type="text_general" indexed="false" stored="true"
> multiValued="true"/>
>    <field name="q_e" type="string" indexed="true" stored="true" />
>
>  <copyField source="q" dest="q_e" maxChars="250"/>
>
> what I want to achieve is that if there is an exact query match it should
> provide the results from q_e followed by results from partial match from q
> field and if there is nothing in q_e field then partial matches should come
> from q field.  This is how I specify the query
>
> http://localhost:7983/zoom/jp/select/?q=鹿児島
> 鹿児島銀行&rows=10&version=2.2&qf=query+query_exact^1&mm=90%25&pf=q^1+q_e^10
> OR
> &version=2.2&rows=10&qf=q+q_e^1&pf=query^10+query_exact^1
>
> somehow the exact query matches results do not come on top, though the data
> contains it. It is puzzling that all the documents do not get indexed
> properly, but if I change the q field to string and q_e to text_ja then all
> the records are indexed properly, but that still does not solve the problem
> of exact match on top followed by partial matches.
>
> text_ja field uses:
> <filter class="solr.JapaneseBaseFormFilterFactory"/>
> <filter class="solr.JapanesePartOfSpeechStopFilterFactory"
> tags="../../../solr/conf/lang/stoptags_ja.txt"
> enablePositionIncrements="true"/>
>   <filter class="solr.CJKWidthFilterFactory"/>
> <filter class="solr.StopFilterFactory" ignoreCase="true"
> words="../../../solr/conf/lang/stopwords_ja.txt"
> enablePositionIncrements="true" />
>  <filter class="solr.JapaneseKatakanaStemFilterFactory" minimumLength="4"/>
>   <filter class="solr.LowerCaseFilterFactory"/>
>
>      How to solve this problem,
>
> Thanks
>
>
>
>
>
>
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Japanese-exact-match-results-do-not-show-on-top-of-results-tp4028422.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>