You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Nina Khosravi <kh...@us.ibm.com> on 2007/07/23 06:38:20 UTC

CJKAnalyzer - Issues with scoring


Hello,

I am in desperate need of help on a problem with our newly deployed
application that is now using the CJKAnalyzer/CJKTokenizer.  Our previous
release was using the WhitespaceAnalyzer and the users were extremely happy
with the search results ordered by the score of the documents.  We switched
to the CJKAnalyzer and now the scoring is not giving them the results they
are expecting.   Any ideas on why using the CJKAnalyzer would be so
different?  Some documents that contain more instances of a search term are
appearing at a lower position in the search results than those having one
instance of the search term.

I tried printing the explanation of the scoring but don't get any
information, just the value 0.0 for each doc returned.  Anyone have any
ideas?  I have not dug deep into this problem.  I am pressed for time so
was hoping someone could provide some guidance.

Thanks!!

Nina

Re: CJKAnalyzer - Issues with scoring

Posted by Nina Khosravi <kh...@us.ibm.com>.
Hello,

I actually found an error in my code.  Thanks to anyone taking a look at
this but the way I was setting the "score" sort field had an error.  Sorry
about this.

Thanks,
Nina

Re: CJKAnalyzer - Issues with scoring

Posted by Nina Khosravi <kh...@us.ibm.com>.
Sure.  It is one of the contributed analyzers.

The Java docs are here:
http://lucene.zones.apache.org:8080/hudson/job/Lucene-Nightly/javadoc/index.html.

Also, the source files are  here:
http://svn.apache.org/viewvc/lucene/java/trunk/contrib/analyzers/src/java/org/apache/lucene/analysis/cjk/

Thanks,
Nina



                                                                           
             "Dmitry"                                                      
             <dmitrytkach1@hot                                             
             mail.com>                                                  To 
                                       <ja...@lucene.apache.org>       
             07/22/2007 10:44                                           cc 
             PM                                                            
                                                                   Subject 
                                       Re: CJKAnalyzer - Issues with       
             Please respond to         scoring                             
             java-user@lucene.                                             
                apache.org                                                 
                                                                           
                                                                           
                                                                           
                                                                           




Nina,
can you point me to the link where I can get documentation about
CJKAnalyzer.
thanks,
DT
www.ejinz.com
Search Engine Economy

----- Original Message -----
From: "Nina Khosravi" <kh...@us.ibm.com>
To: <ja...@lucene.apache.org>
Sent: Sunday, July 22, 2007 11:38 PM
Subject: CJKAnalyzer - Issues with scoring




Hello,

I am in desperate need of help on a problem with our newly deployed
application that is now using the CJKAnalyzer/CJKTokenizer.  Our previous
release was using the WhitespaceAnalyzer and the users were extremely happy
with the search results ordered by the score of the documents.  We switched
to the CJKAnalyzer and now the scoring is not giving them the results they
are expecting.   Any ideas on why using the CJKAnalyzer would be so
different?  Some documents that contain more instances of a search term are
appearing at a lower position in the search results than those having one
instance of the search term.

I tried printing the explanation of the scoring but don't get any
information, just the value 0.0 for each doc returned.  Anyone have any
ideas?  I have not dug deep into this problem.  I am pressed for time so
was hoping someone could provide some guidance.

Thanks!!

Nina


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Re: CJKAnalyzer - Issues with scoring

Posted by Dmitry <dm...@hotmail.com>.
Nina,
can you point me to the link where I can get documentation about 
CJKAnalyzer.
thanks,
DT
www.ejinz.com
Search Engine Economy

----- Original Message ----- 
From: "Nina Khosravi" <kh...@us.ibm.com>
To: <ja...@lucene.apache.org>
Sent: Sunday, July 22, 2007 11:38 PM
Subject: CJKAnalyzer - Issues with scoring




Hello,

I am in desperate need of help on a problem with our newly deployed
application that is now using the CJKAnalyzer/CJKTokenizer.  Our previous
release was using the WhitespaceAnalyzer and the users were extremely happy
with the search results ordered by the score of the documents.  We switched
to the CJKAnalyzer and now the scoring is not giving them the results they
are expecting.   Any ideas on why using the CJKAnalyzer would be so
different?  Some documents that contain more instances of a search term are
appearing at a lower position in the search results than those having one
instance of the search term.

I tried printing the explanation of the scoring but don't get any
information, just the value 0.0 for each doc returned.  Anyone have any
ideas?  I have not dug deep into this problem.  I am pressed for time so
was hoping someone could provide some guidance.

Thanks!!

Nina


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org