You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Daulet Kadirbekov (JIRA)" <ji...@apache.org> on 2014/11/28 10:17:13 UTC

[jira] [Comment Edited] (SOLR-6802) Solr AnalyticsQuery API returns analytics by documents that don't match query

    [ https://issues.apache.org/jira/browse/SOLR-6802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14228174#comment-14228174 ] 

Daulet Kadirbekov edited comment on SOLR-6802 at 11/28/14 9:16 AM:
-------------------------------------------------------------------

1)Debug output:
"debug":{
    "rawquerystring":"host:kp.kz",
    "querystring":"host:kp.kz",
    "parsedquery":"host:kp.kz",
    "parsedquery_toString":"host:kp.kz",
    "explain":{
      "http://kp.kz/video/6628-neizvestnyj-ustroil-strelbu-iz-avtomata-v-almaty":"\n1.2848014 = (MATCH) weight(host:kp.kz in 0) [DefaultSimilarity], result of:\n  1.2848014 = fieldWeight in 0, product of:\n    1.0 = tf(freq=1.0), with freq of:\n      1.0 = termFreq=1.0\n    1.2848014 = idf(docFreq=2867, maxDocs=3813)\n    1.0 = fieldNorm(doc=0)\n",
      "http://kp.kz/video/6850-stolknovenie-poezda-s-avtobusom-v-petropavlovske-snyala-kamera-videonablyudeniya":"\n1.2848014 = (MATCH) weight(host:kp.kz in 9) [DefaultSimilarity], result of:\n  1.2848014 = fieldWeight in 9, product of:\n    1.0 = tf(freq=1.0), with freq of:\n      1.0 = termFreq=1.0\n    1.2848014 = idf(docFreq=2867, maxDocs=3813)\n    1.0 = fieldNorm(doc=9)\n"
},
    "QParser":"LuceneQParser",
    "filter_queries":["{!sentimentAnalytics cost=101}",
      "{!resourceAnalytics cost=112}",
      "{!finalAnalytics cost=121}"],
    "parsed_filter_queries":["SentimentAnalyticsQuery({!cache=false cost=101})",
      "ResourceAnalyticsQuery({!cache=false cost=112})",
      "FinalAnalyticsQuery({!cache=false cost=121})"],
    "timing":{
      "time":968.0,
      "prepare":{
        "time":0.0,
        "query":{
          "time":0.0},
        "facet":{
          "time":0.0},
        "mlt":{
          "time":0.0},
        "highlight":{
          "time":0.0},
        "stats":{
          "time":0.0},
        "expand":{
          "time":0.0},
        "debug":{
          "time":0.0}},
      "process":{
        "time":968.0,
        "query":{
          "time":964.0},
        "facet":{
          "time":0.0},
        "mlt":{
          "time":0.0},
        "highlight":{
          "time":0.0},
        "stats":{
          "time":0.0},
        "expand":{
          "time":0.0},
        "debug":{
          "time":4.0}}}}}

2)Docs being matched:
Number of documents - 2867
Number of documents in analytics that matches query - 1924


was (Author: d.kadirbekov):
1)Debug output:
"debug":{
    "rawquerystring":"host:kp.kz",
    "querystring":"host:kp.kz",
    "parsedquery":"host:kp.kz",
    "parsedquery_toString":"host:kp.kz",
    "explain":{
      "http://kp.kz/video/6628-neizvestnyj-ustroil-strelbu-iz-avtomata-v-almaty":"\n1.2848014 = (MATCH) weight(host:kp.kz in 0) [DefaultSimilarity], result of:\n  1.2848014 = fieldWeight in 0, product of:\n    1.0 = tf(freq=1.0), with freq of:\n      1.0 = termFreq=1.0\n    1.2848014 = idf(docFreq=2867, maxDocs=3813)\n    1.0 = fieldNorm(doc=0)\n",
      "http://kp.kz/video/6850-stolknovenie-poezda-s-avtobusom-v-petropavlovske-snyala-kamera-videonablyudeniya":"\n1.2848014 = (MATCH) weight(host:kp.kz in 9) [DefaultSimilarity], result of:\n  1.2848014 = fieldWeight in 9, product of:\n    1.0 = tf(freq=1.0), with freq of:\n      1.0 = termFreq=1.0\n    1.2848014 = idf(docFreq=2867, maxDocs=3813)\n    1.0 = fieldNorm(doc=9)\n"
},
    "QParser":"LuceneQParser",
    "filter_queries":["{!sentimentAnalytics cost=101}",
      "{!resourceAnalytics cost=112}",
      "{!finalAnalytics cost=121}"],
    "parsed_filter_queries":["SentimentAnalyticsQuery({!cache=false cost=101})",
      "ResourceAnalyticsQuery({!cache=false cost=112})",
      "FinalAnalyticsQuery({!cache=false cost=121})"],
    "timing":{
      "time":968.0,
      "prepare":{
        "time":0.0,
        "query":{
          "time":0.0},
        "facet":{
          "time":0.0},
        "mlt":{
          "time":0.0},
        "highlight":{
          "time":0.0},
        "stats":{
          "time":0.0},
        "expand":{
          "time":0.0},
        "debug":{
          "time":0.0}},
      "process":{
        "time":968.0,
        "query":{
          "time":964.0},
        "facet":{
          "time":0.0},
        "mlt":{
          "time":0.0},
        "highlight":{
          "time":0.0},
        "stats":{
          "time":0.0},
        "expand":{
          "time":0.0},
        "debug":{
          "time":4.0}}}}}

2)Docs being matched:
Number of documents - 2867
Number of documents in analytics that matches query - 1924

> Solr AnalyticsQuery API returns analytics by documents that don't match query
> -----------------------------------------------------------------------------
>
>                 Key: SOLR-6802
>                 URL: https://issues.apache.org/jira/browse/SOLR-6802
>             Project: Solr
>          Issue Type: Bug
>          Components: query parsers
>    Affects Versions: 4.10.2
>            Reporter: Daulet Kadirbekov
>            Priority: Critical
>
> Solr AnalyticsQuery API returns analytics by documents that don't match query.
> Example:
> Solr query: domain:example.com
> I looked for all documents by /select search handler and all of this documents was belonged to example.com domain.
> But when I looked in analytics for documents, I have found that there are many documents which don't match query. But number of documents is the same.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org