You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Shawn Heisey (JIRA)" <ji...@apache.org> on 2014/05/16 12:29:23 UTC

[jira] [Closed] (SOLR-6053) 搜索文档总数不一致

     [ https://issues.apache.org/jira/browse/SOLR-6053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Shawn Heisey closed SOLR-6053.
------------------------------

       Resolution: Not a Problem
    Fix Version/s:     (was: 4.7)

Trying to read between the lines (because as Ahmet has noted, you haven't really given us anything to go on)...

If you have a sharded index and the numFound changes when you run the same query more than once, then it is likely that you have documents with the same uniqueKey field value in more than one shard.  Solr assumes that every document across all shards has a unique value in the uniqueKey field.  If this is not the case, then Solr cannot guarantee correct results.

Solr is smart enough to eliminate duplicates from any results that are returned, but in order for that to happen across the whole index, every document must be considered -- which is why it works properly when rows is larger than numFound.

This should have been brought up on the solr-user mailing list, not as an issue in Jira.

http://lucene.apache.org/solr/discussion.html

Closing as Not a Problem.  If further investigation via regular support avenues (like the mailing list or the IRC channel) reveals that there is a bug, we can reopen.


> 搜索文档总数不一致
> ---------
>
>                 Key: SOLR-6053
>                 URL: https://issues.apache.org/jira/browse/SOLR-6053
>             Project: Solr
>          Issue Type: Bug
>          Components: clients - java
>    Affects Versions: 4.7
>         Environment: Centos
>            Reporter: wanggang
>            Priority: Critical
>   Original Estimate: 0.05h
>  Remaining Estimate: 0.05h
>
> http://192.168.3.21:8901/sentiment/search?q=%E6%B2%A5%E9%9D%92%E7%BD%90%E8%B5%B7%E7%81%AB&hlfl=title,content&hlsimple=red&start=0&rows=10
> start 切换不同的数值就能看到效果了 
> As my in deep test find out, if the rows=0, the results size is consistently the total sum of the documents on all shards regardless there is any duplicates; if the rows is a number larger than the supposedly returned the merge document number, the result numFound is accurate and consistent, however, if the rows is with a number smaller than the supposedly merge results size, it will be non-deterministic. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org