You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by ao...@hispeed.ch on 2014/03/20 16:38:13 UTC

understand debuginfo from query

i want the infos simplified so that the user can see why a doc was found

bellow is the output a a doc:

0.085597195 = (MATCH) sum of:
  0.083729245 = (MATCH) max of:
    0.0019158133 = (MATCH) weight(plain_text:test^10.0 in 601) [DefaultSimilarity], result of:
      0.0019158133 = score(doc=601,freq=9.0 = termFreq=9.0
), product of:
        0.022560213 = queryWeight, product of:
          10.0 = boost
          3.6232536 = idf(docFreq=81, maxDocs=1130)
          6.2265067E-4 = queryNorm
        0.084920004 = fieldWeight in 601, product of:
          3.0 = tf(freq=9.0), with freq of:
            9.0 = termFreq=9.0
          3.6232536 = idf(docFreq=81, maxDocs=1130)
          0.0078125 = fieldNorm(doc=601)
    0.083729245 = (MATCH) weight(inhaltstyp:test^6.0 in 601) [DefaultSimilarity], result of:
      0.083729245 = score(doc=601,freq=1.0 = termFreq=1.0
), product of:
        0.017686278 = queryWeight, product of:
          6.0 = boost
          4.734136 = idf(docFreq=26, maxDocs=1130)
          6.2265067E-4 = queryNorm
        4.734136 = fieldWeight in 601, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          4.734136 = idf(docFreq=26, maxDocs=1130)
          1.0 = fieldNorm(doc=601)
    0.013458222 = (MATCH) weight(title:test^20.0 in 601) [DefaultSimilarity], result of:
      0.013458222 = score(doc=601,freq=1.0 = termFreq=1.0
), product of:
        0.042281017 = queryWeight, product of:
          20.0 = boost
          3.395244 = idf(docFreq=102, maxDocs=1130)
          6.2265067E-4 = queryNorm
        0.31830412 = fieldWeight in 601, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          3.395244 = idf(docFreq=102, maxDocs=1130)
          0.09375 = fieldNorm(doc=601)
  0.001867952 = (MATCH) product of:
    0.003735904 = (MATCH) sum of:
      0.003735904 = (MATCH) ConstantScore(expiration:[1395328539325 TO *]), product of:
        1.0 = boost
        0.003735904 = queryNorm
    0.5 = coord(1/2)
  0.0 = (MATCH) FunctionQuery(div(int(clicks),max(int(displays),const(1)))), product of:
    0.0 = div(int(clicks)=0,max(int(displays)=432,const(1)))
    8.0 = boost
    6.2265067E-4 = queryNorm 


why is the sum 0.085597195? this would mean 0.083729245 + 0.001867952 and these are not included in the sum: 0.0019158133 + 0.013458222  + 0.003735904 

am i looking at the wrong total?
aren't these 2 cases the ones i have to sum up "x = (MATCH) sum of" or x = score(" ?

i'm trying to extract the fields that where used for weighing the doc.


Re: understand debuginfo from query

Posted by Jack Krupansky <ja...@basetechnology.com>.
How are you getting the impression that is not worth anything? It appears to 
be worth more than the inhaltstyp field, but less than the title field - as 
you have specified by your field boosts.

-- Jack Krupansky

-----Original Message----- 
From: aowen@hispeed.ch
Sent: Friday, March 21, 2014 4:59 AM
To: solr-user@lucene.apache.org
Subject: Re: understand debuginfo from query

i found a good page to explain the debug output but it is still unclear for 
me. why is the field plain_text not worth anything? the query term was found 
3 times.

you can see it here: http://explain.solr.pl/explains/a90aze3o



---- aowen@hispeed.ch schrieb:
> i want the infos simplified so that the user can see why a doc was found
>
> bellow is the output a a doc:
>
> 0.085597195 = (MATCH) sum of:
>   0.083729245 = (MATCH) max of:
>     0.0019158133 = (MATCH) weight(plain_text:test^10.0 in 601) 
> [DefaultSimilarity], result of:
>       0.0019158133 = score(doc=601,freq=9.0 = termFreq=9.0
> ), product of:
>         0.022560213 = queryWeight, product of:
>           10.0 = boost
>           3.6232536 = idf(docFreq=81, maxDocs=1130)
>           6.2265067E-4 = queryNorm
>         0.084920004 = fieldWeight in 601, product of:
>           3.0 = tf(freq=9.0), with freq of:
>             9.0 = termFreq=9.0
>           3.6232536 = idf(docFreq=81, maxDocs=1130)
>           0.0078125 = fieldNorm(doc=601)
>     0.083729245 = (MATCH) weight(inhaltstyp:test^6.0 in 601) 
> [DefaultSimilarity], result of:
>       0.083729245 = score(doc=601,freq=1.0 = termFreq=1.0
> ), product of:
>         0.017686278 = queryWeight, product of:
>           6.0 = boost
>           4.734136 = idf(docFreq=26, maxDocs=1130)
>           6.2265067E-4 = queryNorm
>         4.734136 = fieldWeight in 601, product of:
>           1.0 = tf(freq=1.0), with freq of:
>             1.0 = termFreq=1.0
>           4.734136 = idf(docFreq=26, maxDocs=1130)
>           1.0 = fieldNorm(doc=601)
>     0.013458222 = (MATCH) weight(title:test^20.0 in 601) 
> [DefaultSimilarity], result of:
>       0.013458222 = score(doc=601,freq=1.0 = termFreq=1.0
> ), product of:
>         0.042281017 = queryWeight, product of:
>           20.0 = boost
>           3.395244 = idf(docFreq=102, maxDocs=1130)
>           6.2265067E-4 = queryNorm
>         0.31830412 = fieldWeight in 601, product of:
>           1.0 = tf(freq=1.0), with freq of:
>             1.0 = termFreq=1.0
>           3.395244 = idf(docFreq=102, maxDocs=1130)
>           0.09375 = fieldNorm(doc=601)
>   0.001867952 = (MATCH) product of:
>     0.003735904 = (MATCH) sum of:
>       0.003735904 = (MATCH) ConstantScore(expiration:[1395328539325 TO 
> *]), product of:
>         1.0 = boost
>         0.003735904 = queryNorm
>     0.5 = coord(1/2)
>   0.0 = (MATCH) 
> FunctionQuery(div(int(clicks),max(int(displays),const(1)))), product of:
>     0.0 = div(int(clicks)=0,max(int(displays)=432,const(1)))
>     8.0 = boost
>     6.2265067E-4 = queryNorm
>
>
> why is the sum 0.085597195? this would mean 0.083729245 + 0.001867952 and 
> these are not included in the sum: 0.0019158133 + 0.013458222  + 
> 0.003735904
>
> am i looking at the wrong total?
> aren't these 2 cases the ones i have to sum up "x = (MATCH) sum of" or x = 
> score(" ?
>
> i'm trying to extract the fields that where used for weighing the doc.
> 

Re: understand debuginfo from query

Posted by ao...@hispeed.ch.
i found a good page to explain the debug output but it is still unclear for me. why is the field plain_text not worth anything? the query term was found 3 times.

you can see it here: http://explain.solr.pl/explains/a90aze3o



---- aowen@hispeed.ch schrieb:
> i want the infos simplified so that the user can see why a doc was found
> 
> bellow is the output a a doc:
> 
> 0.085597195 = (MATCH) sum of:
>   0.083729245 = (MATCH) max of:
>     0.0019158133 = (MATCH) weight(plain_text:test^10.0 in 601) [DefaultSimilarity], result of:
>       0.0019158133 = score(doc=601,freq=9.0 = termFreq=9.0
> ), product of:
>         0.022560213 = queryWeight, product of:
>           10.0 = boost
>           3.6232536 = idf(docFreq=81, maxDocs=1130)
>           6.2265067E-4 = queryNorm
>         0.084920004 = fieldWeight in 601, product of:
>           3.0 = tf(freq=9.0), with freq of:
>             9.0 = termFreq=9.0
>           3.6232536 = idf(docFreq=81, maxDocs=1130)
>           0.0078125 = fieldNorm(doc=601)
>     0.083729245 = (MATCH) weight(inhaltstyp:test^6.0 in 601) [DefaultSimilarity], result of:
>       0.083729245 = score(doc=601,freq=1.0 = termFreq=1.0
> ), product of:
>         0.017686278 = queryWeight, product of:
>           6.0 = boost
>           4.734136 = idf(docFreq=26, maxDocs=1130)
>           6.2265067E-4 = queryNorm
>         4.734136 = fieldWeight in 601, product of:
>           1.0 = tf(freq=1.0), with freq of:
>             1.0 = termFreq=1.0
>           4.734136 = idf(docFreq=26, maxDocs=1130)
>           1.0 = fieldNorm(doc=601)
>     0.013458222 = (MATCH) weight(title:test^20.0 in 601) [DefaultSimilarity], result of:
>       0.013458222 = score(doc=601,freq=1.0 = termFreq=1.0
> ), product of:
>         0.042281017 = queryWeight, product of:
>           20.0 = boost
>           3.395244 = idf(docFreq=102, maxDocs=1130)
>           6.2265067E-4 = queryNorm
>         0.31830412 = fieldWeight in 601, product of:
>           1.0 = tf(freq=1.0), with freq of:
>             1.0 = termFreq=1.0
>           3.395244 = idf(docFreq=102, maxDocs=1130)
>           0.09375 = fieldNorm(doc=601)
>   0.001867952 = (MATCH) product of:
>     0.003735904 = (MATCH) sum of:
>       0.003735904 = (MATCH) ConstantScore(expiration:[1395328539325 TO *]), product of:
>         1.0 = boost
>         0.003735904 = queryNorm
>     0.5 = coord(1/2)
>   0.0 = (MATCH) FunctionQuery(div(int(clicks),max(int(displays),const(1)))), product of:
>     0.0 = div(int(clicks)=0,max(int(displays)=432,const(1)))
>     8.0 = boost
>     6.2265067E-4 = queryNorm 
> 
> 
> why is the sum 0.085597195? this would mean 0.083729245 + 0.001867952 and these are not included in the sum: 0.0019158133 + 0.013458222  + 0.003735904 
> 
> am i looking at the wrong total?
> aren't these 2 cases the ones i have to sum up "x = (MATCH) sum of" or x = score(" ?
> 
> i'm trying to extract the fields that where used for weighing the doc.
>