You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "David Smiley (JIRA)" <ji...@apache.org> on 2017/11/01 20:07:00 UTC

[jira] [Created] (SOLR-11595) optimize SolrIndexSearcher.localCollectionStatistics to use cached MultiFields

David Smiley created SOLR-11595:
-----------------------------------

             Summary: optimize SolrIndexSearcher.localCollectionStatistics to use cached MultiFields
                 Key: SOLR-11595
                 URL: https://issues.apache.org/jira/browse/SOLR-11595
             Project: Solr
          Issue Type: Improvement
      Security Level: Public (Default Security Level. Issues are Public)
          Components: search
            Reporter: David Smiley
            Assignee: David Smiley
            Priority: Minor
             Fix For: 7.2


{{SolrIndexSearcher.localCollectionStatistics(field)}} simply calls Lucene's {{IndexSearcher.collectionStatistics(field)}} which in turn calls {{MultiFields.getTerms(reader, field)}}.  Profiling in an app with many 150 fields in the query shows that building the MultiTerms here is expensive.  Fortunately it turns out that Solr already has a cached instance via {{SlowCompositeReaderWrapper}} (using MultiFields which has a ConcurrentHashMap to the MultiTerms keyed by field String.

Perhaps this should be improved on the Lucene side... not sure.  But here on the Solr side, the solution is straight-forward.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org