You are viewing a plain text version of this content. The canonical link for it is here.

Posted to solr-user@lucene.apache.org by kafka0102 <ka...@163.com> on 2010/10/20 13:07:11 UTC

why solr search is slower than lucene so much?

HI.
my solr seach has some performance problem recently.
my query is like that: q=xx&fq=fid:1&fq=atm:[int_time1 TO int_time2],
fid's type is : <fieldType name="int" class="solr.TrieIntField" 
precisionStep="0" omitNorms="true" positionIncrementGap="0"/>
atm's type is : <fieldType name="sint" class="solr.TrieIntField" 
precisionStep="8" omitNorms="true" positionIncrementGap="0"/>
my index's size is about 500M and record num is 3984274.
when I use solr's SolrIndexSearcher.search(QueryResult qr, QueryCommand 
cmd), it cost about70ms. When I changed use lucence'API, just like bottom:

       final SolrQueryRequest req = rb.req;
       final SolrIndexSearcher searcher = req.getSearcher();
       final SolrIndexSearcher.QueryCommand cmd = rb.getQueryCommand();
       final ExecuteTimeStatics timeStatics = 
ExecuteTimeStatics.getExecuteTimeStatics();
       final ExecuteTimeUnit staticUnit = 
timeStatics.addExecuteTimeUnit("test2");
       staticUnit.start();
       final List<Query> query = cmd.getFilterList();
       final BooleanQuery booleanFilter = new BooleanQuery();
       for (final Query q : query) {
         booleanFilter.add(new BooleanClause(q,Occur.MUST));
       }
       booleanFilter.add(new BooleanClause(cmd.getQuery(),Occur.MUST));
       logger.info("q:"+query);
       final Sort sort = cmd.getSort();
       final TopFieldDocs docs = 
searcher.search(booleanFilter,null,20,sort);
       final StringBuilder sbBuilder = new StringBuilder();
       for (final ScoreDoc doc :docs.scoreDocs) {
         sbBuilder.append(doc.doc+",");
       }
       logger.info("hits:"+docs.totalHits+",result:"+sbBuilder.toString());
       staticUnit.end();

it cost only about 20ms.
I'm so confused. For solr's config, I closed cache. For test, I first 
called lucene's, and then solr's.
Maybe I should look solr's source more carefully. But now, can anyone 
knows the reason?

Re: why solr search is slower than lucene so much?

Posted by Yonik Seeley <yo...@lucidimagination.com>.

Careful comparing apples to oranges ;-)
For one, your lucene code doesn't retrieve stored fields.
Did you try the solr request more than once (with a different q, but
the same filters?)

Also, by default, Solr independently caches the filters.  This can be
higher up-front cost, but a win when filters are reused.  If you want
something closer to your lucene code, you could add all the filters to
 the main query and not use "fq".

-Yonik
http://www.lucidimagination.com



On Wed, Oct 20, 2010 at 7:07 AM, kafka0102 <ka...@163.com> wrote:
> HI.
> my solr seach has some performance problem recently.
> my query is like that: q=xx&fq=fid:1&fq=atm:[int_time1 TO int_time2],
> fid's type is : <fieldType name="int" class="solr.TrieIntField"
> precisionStep="0" omitNorms="true" positionIncrementGap="0"/>
> atm's type is : <fieldType name="sint" class="solr.TrieIntField"
> precisionStep="8" omitNorms="true" positionIncrementGap="0"/>
> my index's size is about 500M and record num is 3984274.
> when I use solr's SolrIndexSearcher.search(QueryResult qr, QueryCommand
> cmd), it cost about70ms. When I changed use lucence'API, just like bottom:
>
>      final SolrQueryRequest req = rb.req;
>      final SolrIndexSearcher searcher = req.getSearcher();
>      final SolrIndexSearcher.QueryCommand cmd = rb.getQueryCommand();
>      final ExecuteTimeStatics timeStatics =
> ExecuteTimeStatics.getExecuteTimeStatics();
>      final ExecuteTimeUnit staticUnit =
> timeStatics.addExecuteTimeUnit("test2");
>      staticUnit.start();
>      final List<Query> query = cmd.getFilterList();
>      final BooleanQuery booleanFilter = new BooleanQuery();
>      for (final Query q : query) {
>        booleanFilter.add(new BooleanClause(q,Occur.MUST));
>      }
>      booleanFilter.add(new BooleanClause(cmd.getQuery(),Occur.MUST));
>      logger.info("q:"+query);
>      final Sort sort = cmd.getSort();
>      final TopFieldDocs docs = searcher.search(booleanFilter,null,20,sort);
>      final StringBuilder sbBuilder = new StringBuilder();
>      for (final ScoreDoc doc :docs.scoreDocs) {
>        sbBuilder.append(doc.doc+",");
>      }
>      logger.info("hits:"+docs.totalHits+",result:"+sbBuilder.toString());
>      staticUnit.end();
>
> it cost only about 20ms.
> I'm so confused. For solr's config, I closed cache. For test, I first called
> lucene's, and then solr's.
> Maybe I should look solr's source more carefully. But now, can anyone knows
> the reason?
>
>
>