You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by "Yonik Seeley (Jira)" <ji...@apache.org> on 2020/02/19 17:56:00 UTC

[jira] [Commented] (SOLR-13813) Shared storage online split support

    [ https://issues.apache.org/jira/browse/SOLR-13813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17040279#comment-17040279 ] 

Yonik Seeley commented on SOLR-13813:
-------------------------------------

Attaching test in progress for live shard splitting with a failure on the leader node during the split.
It currently fails for both NRT replicas and shared storage replicas.  I'm still investigating and haven't opened a new issue yet because it could be a test issue.

When it fails for NRT, it can often be a single missing document.
{code}
24745 ERROR (TEST-SharedStorageSplitTest.testLiveSplitFail-seed#[2D0A18F897BF536C]) [     ] o.a.s.s.b.SharedStorageSplitTest MISSING DOCUMENTS: {doc_5137=1}
24748 INFO  (TEST-SharedStorageSplitTest.testLiveSplitFail-seed#[2D0A18F897BF536C]) [     ] o.a.s.SolrTestCaseJ4 ###Ending testLiveSplitFail
NOTE: reproduce with: ant test  -Dtestcase=SharedStorageSplitTest -Dtests.method=testLiveSplitFail -Dtests.seed=2D0A18F897BF536C -Dtests.slow=true -Dtests.badapples=true -Dtests.locale=en-DM -Dtests.timezone=America/Barbados -Dtests.asserts=true -Dtests.file.encoding=UTF-8

java.lang.AssertionError: Documents are missing! 
Expected :5106
Actual   :5107
{code}

> Shared storage online split support
> -----------------------------------
>
>                 Key: SOLR-13813
>                 URL: https://issues.apache.org/jira/browse/SOLR-13813
>             Project: Solr
>          Issue Type: Sub-task
>            Reporter: Yonik Seeley
>            Priority: Major
>         Attachments: SOLR-13813.patch
>
>          Time Spent: 1h
>  Remaining Estimate: 0h
>
> The strategy for online shard splitting is the same as that for normal (non SHARED shards.)
> During a split, the leader will forward updates to sub-shard leaders, those updates will be buffered by the transaction log while the split is in progress, and then the buffered updates are replayed.
> One change that was added was to push the local index to blob store after buffered updates are applied (but before it is marked as ACTIVE):
> See https://github.com/apache/lucene-solr/commit/fe17c813f5fe6773c0527f639b9e5c598b98c7d4#diff-081b7c2242d674bb175b41b6afc21663
> This issue is about adding tests and ensuring that online shard splitting (while updates are flowing) works reliably.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org