You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Vannia Rajan (JIRA)" <ji...@apache.org> on 2017/08/24 14:01:17 UTC
[jira] [Created] (SOLR-11287) Sub-shards by SPLITSHARD loses data
on restarting SOLR
Vannia Rajan created SOLR-11287:
-----------------------------------
Summary: Sub-shards by SPLITSHARD loses data on restarting SOLR
Key: SOLR-11287
URL: https://issues.apache.org/jira/browse/SOLR-11287
Project: Solr
Issue Type: Bug
Security Level: Public (Default Security Level. Issues are Public)
Components: SolrCloud
Affects Versions: 5.5.1
Environment: Ubuntu 64-bit 32-core server, 240GB RAM
Reporter: Vannia Rajan
We are running SOLR 5.5.1 with 4 nodes (1 shard per node). We are in the process of splitting the 4 shards into 8 shards.
The SPLITSHARD collections API works great - it does create the sub-shards and activates sub-shards, inactivates the parent shard upon completion. The row count compard with parent shard vs sub-shards are matching. However, the data in sub-shards doesn't seem to be persistent in our case.
A restart of SOLR leaves the sub-shards with 0 documents with their data directory sizes getting reduced from 40+ GB to 71KB.
If I'm missing any other steps to be followed after SPLITSHARD to make the data in sub-shards persistent, please let me know. Otherwise, I feel this may be a bug in v5.5.1.
Note: I was able to manually set the states of parent to "active" and children with 0 documents as "inactive" by setting /collections/{collection}/state.json in zookeeper, to get back the lost data.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org