You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Vannia Rajan (JIRA)" <ji...@apache.org> on 2017/08/24 14:01:17 UTC

[jira] [Created] (SOLR-11287) Sub-shards by SPLITSHARD loses data on restarting SOLR

Vannia Rajan created SOLR-11287:
-----------------------------------

             Summary: Sub-shards by SPLITSHARD loses data on restarting SOLR
                 Key: SOLR-11287
                 URL: https://issues.apache.org/jira/browse/SOLR-11287
             Project: Solr
          Issue Type: Bug
      Security Level: Public (Default Security Level. Issues are Public)
          Components: SolrCloud
    Affects Versions: 5.5.1
         Environment: Ubuntu 64-bit 32-core server, 240GB RAM
            Reporter: Vannia Rajan


We are running SOLR 5.5.1 with 4 nodes (1 shard per node). We are in the process of splitting the 4 shards into 8 shards.

The SPLITSHARD collections API works great - it does create the sub-shards and activates sub-shards, inactivates the parent shard upon completion. The row count compard with parent shard vs sub-shards are matching. However, the data in sub-shards doesn't seem to be persistent in our case.

A restart of SOLR leaves the sub-shards with 0 documents with their data directory sizes getting reduced from 40+ GB to 71KB.

If I'm missing any other steps to be followed after SPLITSHARD to make the data in sub-shards persistent, please let me know. Otherwise, I feel this may be a bug in v5.5.1.

Note: I was able to manually set the states of parent to "active" and children with 0 documents as "inactive" by setting /collections/{collection}/state.json in zookeeper, to get back the lost data.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org