You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Amrit Sarkar (JIRA)" <ji...@apache.org> on 2017/10/07 15:06:00 UTC

[jira] [Comment Edited] (SOLR-11278) Fix race in cdcr bootstrap process

    [ https://issues.apache.org/jira/browse/SOLR-11278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16195690#comment-16195690 ] 

Amrit Sarkar edited comment on SOLR-11278 at 10/7/17 3:05 PM:
--------------------------------------------------------------

[~varunthacker],

Failure is very inconsistent. None of the failuers accessible at https://jenkins.thetaphi.de/job/Lucene-Solr-7.x-Linux/ have the test failing and I am unable to produce it for 100 and 500 beasts. Not sure, how to proceed on this now.


was (Author: sarkaramrit2@gmail.com):
[~varunthacker],

Failure is very inconsistent. None of the failuers accessible at https://jenkins.thetaphi.de/job/Lucene-Solr-7.x-Linux/ doesn't have the test failing and I am unable to produce it for 100 and 500 beasts. Not sure, how to proceed on this now.

> Fix race in cdcr bootstrap process
> ----------------------------------
>
>                 Key: SOLR-11278
>                 URL: https://issues.apache.org/jira/browse/SOLR-11278
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: CDCR
>    Affects Versions: 6.6.1, 7.0
>            Reporter: Amrit Sarkar
>            Assignee: Varun Thacker
>            Priority: Critical
>              Labels: test
>             Fix For: 7.1
>
>         Attachments: master-bs.patch, SOLR-11278-awaits-fix.patch, SOLR-11278-cancel-bootstrap-on-stop.patch, SOLR-11278.patch, SOLR-11278.patch, SOLR-11278.patch, SOLR-11278.patch, test_results
>
>
> {{CdcrBootstrapTest}} is failing while running beasts for significant iterations.
> The bootstrapping is failing in the test, after the first batch is indexed for each {{testmethod}}, which results in documents mismatch ::
> {code}
>   [beaster]   2> 39167 ERROR (updateExecutor-39-thread-1-processing-n:127.0.0.1:42155_solr x:cdcr-target_shard1_replica_n1 s:shard1 c:cdcr-target r:core_node2) [n:127.0.0.1:42155_solr c:cdcr-target s:shard1 r:core_node2 x:cdcr-target_shard1_replica_n1] o.a.s.h.CdcrRequestHandler Bootstrap operation failed
>   [beaster]   2> java.util.concurrent.ExecutionException: java.lang.AssertionError
>   [beaster]   2> 	at java.util.concurrent.FutureTask.report(FutureTask.java:122)
>   [beaster]   2> 	at java.util.concurrent.FutureTask.get(FutureTask.java:192)
>   [beaster]   2> 	at org.apache.solr.handler.CdcrRequestHandler.lambda$handleBootstrapAction$0(CdcrRequestHandler.java:654)
>   [beaster]   2> 	at com.codahale.metrics.InstrumentedExecutorService$InstrumentedRunnable.run(InstrumentedExecutorService.java:176)
>   [beaster]   2> 	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
>   [beaster]   2> 	at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>   [beaster]   2> 	at org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.lambda$execute$0(ExecutorUtil.java:188)
>   [beaster]   2> 	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
>   [beaster]   2> 	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
>   [beaster]   2> 	at java.lang.Thread.run(Thread.java:748)
>   [beaster]   2> Caused by: java.lang.AssertionError
>   [beaster]   2> 	at org.apache.solr.handler.CdcrRequestHandler$BootstrapCallable.call(CdcrRequestHandler.java:813)
>   [beaster]   2> 	at org.apache.solr.handler.CdcrRequestHandler$BootstrapCallable.call(CdcrRequestHandler.java:724)
>   [beaster]   2> 	at com.codahale.metrics.InstrumentedExecutorService$InstrumentedCallable.call(InstrumentedExecutorService.java:197)
>   [beaster]   2> 	... 5 more
> {code}
> {code}
>   [beaster] [01:37:16.282] FAILURE  153s | CdcrBootstrapTest.testBootstrapWithSourceCluster <<<
>   [beaster]    > Throwable #1: java.lang.AssertionError: Document mismatch on target after sync expected:<2000> but was:<1000>
> {code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org