You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by anuvenk <an...@hotmail.com> on 2008/01/05 00:39:23 UTC

Dealing with numbers in search terms

I seem to have problems with the results i get for this search term. Not sure
if its because of the synonym mappings i have for this search term.

Search term: chapter 7

The first result doesn't even have any occurrence of chapter, bankruptcy.
But just a few occurrences of 7. But i have the 'mm' set up so that it
should match both the words if the search term has 2 words as in the case of
chapter 7.

Synonyms i have for chapter & chapter 7:

chap 7 => bankruptcy
chapter => bankruptcy
chap => chapter
chapter 7 => bankruptcy
bankrupcy => bankruptcy
chap,7,chap7,chapter 7,chapter 7 bankruptcy,chap 7

Since i'm new to solr...still learning how its working.

Here is the parsedquery_toString

<str name="parsedquery_toString">
+(text:"(bankruptci chap 7) (7 chapter chap) 7 bankruptci"^0.8 |
((name:bankruptci name:chap)^2.0))~0.01 (text:"(bankruptci chap 7) (7
chapter chap) 7 bankruptci"~50^0.8 | ((name:bankruptci name:chap)^2.0))~0.01
</str>

Here is a portion of my request handler
     <float name="tie">0.01</float>
     <str name="qf">text^0.8 name^2.0</str>
     <!-- until 3 all should match;4 - 3 shld match; 5 - 4 shld match; 6 - 5
shld match; above 6 - 90% match -->
     <str name="mm">3<-1 4<-1 5<-1 6<90%</str>
     <str name="pf">
         text^0.8 name^2.0
     </str>
     <int name="ps">50</int>

So for 'chapter 7' search term i was expecting solr to return all documents
that have both 'chapter' and '7' in the document. But its puzzling why it
was returning some documents that just have the number 7 in them.

It'll be very helpful if i can get some explanation on this behaviour. 
-- 
View this message in context: http://www.nabble.com/Dealing-with-numbers-in-search-terms-tp14627181p14627181.html
Sent from the Solr - User mailing list archive at Nabble.com.