You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Moulay Hicham <ma...@gmail.com> on 2020/12/10 16:28:24 UTC
Solr Collection reload task has been in running state for a very long time
Hi,
We have a solr cluster of 30 nodes with a Replication Factor =3.
Each index size is about 80GB.
Solr version is 8.1
The cluster has high TPS both in read and write.
We have recently made a schema change and uploaded it using ZKCLI
script. Then we issue a collection reload async request:
admin/collections?action=RELOAD&name=<collectionname>&async=1000'
When we check on the status of this request, it shows that it's still running:
admin/collections?action=REQUESTSTATUS&requestid=1000'
{
"responseHeader":{
"status":0,
"QTime":1},
"status":{
"state":"running",
"msg":"found [1000] in running tasks"}}
This task has been in a running state for about 5 hours so far. I am
not sure if this is expected or the status of this task failed or
completed but never reported back to zookeeper.
Also if running for that long - is it because the index is being
actively (with high TPS) updated? We have a softcommit of 10s and
hadcommit of 60s.
Please help me understand what's going on.
Thanks,
Moulay
Re: Solr Collection reload task has been in running state for a very
long time
Posted by Moulay Hicham <ma...@gmail.com>.
Hi,
I will really appreciate if someone can help me with this.
Thank you,
Moulay
On Thu, Dec 10, 2020, 8:28 AM Moulay Hicham <ma...@gmail.com> wrote:
> Hi,
>
> We have a solr cluster of 30 nodes with a Replication Factor =3.
> Each index size is about 80GB.
> Solr version is 8.1
> The cluster has high TPS both in read and write.
>
> We have recently made a schema change and uploaded it using ZKCLI
> script. Then we issue a collection reload async request:
> admin/collections?action=RELOAD&name=<collectionname>&async=1000'
>
> When we check on the status of this request, it shows that it's still
> running:
>
> admin/collections?action=REQUESTSTATUS&requestid=1000'
> {
> "responseHeader":{
> "status":0,
> "QTime":1},
> "status":{
> "state":"running",
> "msg":"found [1000] in running tasks"}}
>
> This task has been in a running state for about 5 hours so far. I am
> not sure if this is expected or the status of this task failed or
> completed but never reported back to zookeeper.
>
> Also if running for that long - is it because the index is being
> actively (with high TPS) updated? We have a softcommit of 10s and
> hadcommit of 60s.
>
> Please help me understand what's going on.
>
> Thanks,
> Moulay
>