You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@solr.apache.org by "Mazen Raafat (Jira)" <ji...@apache.org> on 2021/04/19 01:32:00 UTC

[jira] [Updated] (SOLR-15354) Querying multiple collection performance issue

     [ https://issues.apache.org/jira/browse/SOLR-15354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Mazen Raafat updated SOLR-15354:
--------------------------------
    Attachment: querying collection 2.png
                querying collection 1.png
                thread group.png
                querying alias points to collection 1 and collection 2.png

> Querying multiple collection performance issue
> ----------------------------------------------
>
>                 Key: SOLR-15354
>                 URL: https://issues.apache.org/jira/browse/SOLR-15354
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: search
>    Affects Versions: 8.8
>         Environment: SolrCloud Setup: 3 zookeeper servers and 2 solr cloud nodes
> each Solr node hosted in AWS m5.xlarge EC2 with 8G RAM dedicated to Solr JVM heap 
>  in this environment each collection is one shard and 2 replicas
> for benchmarking i've used JMeter, setting the thread group=50, and loop count=500
>  
>            Reporter: Mazen Raafat
>            Priority: Critical
>              Labels: alias, multiple, performance, performance-benchmark, search
>         Attachments: querying alias points to collection 1 and collection 2.png, querying collection 1.png, querying collection 2.png, thread group.png
>
>
> performance degradation when querying multiple collections using aliases that points to multiple collection or calling the search handler directly with  collection query param as follow
> {{[http://localhost:8983/solr/collection1/select?collection=collection1,collection2,collection3]}}
>  
> in the first test i have queried a collection with about 40k docs, the throughput was ~3k req/sec
> in the second test  i have queried another collection with about 4k docs, the throughput was ~3.5k req/sec
> in the third test i have queried an alias that points to both collections and viola! the throughput dropped to be ~200 req/sec!
> i have tried not to use alias and use solrurl/solr/collection1/select?collections=collection2 and got the same result
>  
> notes:
>  # both collections have the same schema
>  # query and filter query are the same in all tests 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@solr.apache.org
For additional commands, e-mail: issues-help@solr.apache.org