You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2018/10/30 13:06:00 UTC
[jira] [Created] (SOLR-12941) IndexSizeTrigger and splitMethod=link
problems
Andrzej Bialecki created SOLR-12941:
----------------------------------------
Summary: IndexSizeTrigger and splitMethod=link problems
Key: SOLR-12941
URL: https://issues.apache.org/jira/browse/SOLR-12941
Project: Solr
Issue Type: Bug
Security Level: Public (Default Security Level. Issues are Public)
Affects Versions: 7.6, master (8.0)
Reporter: Andrzej Bialecki
Assignee: Andrzej Bialecki
{{IndexSizeTrigger}} can be configured to use {{splitMethod=link}} (SOLR-12730), which uses hard-linking for creating sub-shards.
However, if the trigger uses {{aboveBytes}} condition the resulting sub-shards will not immediately decrease in size, until all of the deleted documents will be expunged (either by gradual merges or by explicit and costly expungeDeletes command). As a result the new sub-shards will still exceed the {{aboveBytes}} threshold, which will cause the trigger to keep generating new split requests.
I see two options how to solve this:
* disallow using {{aboveBytes}} with {{splitMethod=link}}. This unfortunately is a very desirable combination because it monitors the actual index size and uses the fast splitting method.
* calculate an internal estimate of "eventual index size" for an index with deletions, and use this estimate when checking with {{aboveBytes}} instead of the real index size. This of course introduces a potentially significant estimation error but allows to properly treat hard-linked sub-shards with deletions as (eventually) significantly smaller than the parent shard.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org