You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Steve Rowe (JIRA)" <ji...@apache.org> on 2018/10/09 20:20:00 UTC

[jira] [Comment Edited] (SOLR-12739) Make autoscaling policy based replica placement the default strategy for placing replicas

    [ https://issues.apache.org/jira/browse/SOLR-12739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16644039#comment-16644039 ] 

Steve Rowe edited comment on SOLR-12739 at 10/9/18 8:19 PM:
------------------------------------------------------------

{{AutoAddReplicasIntegrationTest.testSimple()}} is also failing without a seed, and the first failing commit is also {{ee09bef7}} on this issue.  E.g. (also from [https://jenkins.thetaphi.de/job/Lucene-Solr-7.x-MacOSX/875/]):

{noformat}
   [junit4]   2> NOTE: reproduce with: ant test  -Dtestcase=AutoAddReplicasIntegrationTest -Dtests.method=testSimple -Dtests.seed=EE09D63ABEE1B3CF -Dtests.slow=true -Dtests.locale=it-CH -Dtests.timezone=Brazil/Acre -Dtests.asserts=true -Dtests.file.encoding=UTF-8
   [junit4] FAILURE  129s J0 | AutoAddReplicasIntegrationTest.testSimple <<<
   [junit4]    > Throwable #1: java.lang.AssertionError: Waiting for collection testSimple1
   [junit4]    > null
   [junit4]    > Live Nodes: [127.0.0.1:61242_solr, 127.0.0.1:61245_solr]
   [junit4]    > Last available state: DocCollection(testSimple1//collections/testSimple1/state.json/20)={
   [junit4]    >   "pullReplicas":"0",
   [junit4]    >   "replicationFactor":"2",
   [junit4]    >   "shards":{
   [junit4]    >     "shard1":{
   [junit4]    >       "range":"80000000-ffffffff",
   [junit4]    >       "state":"active",
   [junit4]    >       "replicas":{
   [junit4]    >         "core_node3":{
   [junit4]    >           "core":"testSimple1_shard1_replica_n1",
   [junit4]    >           "base_url":"http://127.0.0.1:61242/solr",
   [junit4]    >           "node_name":"127.0.0.1:61242_solr",
   [junit4]    >           "state":"active",
   [junit4]    >           "type":"NRT",
   [junit4]    >           "force_set_state":"false",
   [junit4]    >           "leader":"true"},
   [junit4]    >         "core_node12":{
   [junit4]    >           "core":"testSimple1_shard1_replica_n11",
   [junit4]    >           "base_url":"http://127.0.0.1:61242/solr",
   [junit4]    >           "node_name":"127.0.0.1:61242_solr",
   [junit4]    >           "state":"active",
   [junit4]    >           "type":"NRT",
   [junit4]    >           "force_set_state":"false"}}},
   [junit4]    >     "shard2":{
   [junit4]    >       "range":"0-7fffffff",
   [junit4]    >       "state":"active",
   [junit4]    >       "replicas":{
   [junit4]    >         "core_node8":{
   [junit4]    >           "core":"testSimple1_shard2_replica_n6",
   [junit4]    >           "base_url":"http://127.0.0.1:61242/solr",
   [junit4]    >           "node_name":"127.0.0.1:61242_solr",
   [junit4]    >           "state":"active",
   [junit4]    >           "type":"NRT",
   [junit4]    >           "force_set_state":"false",
   [junit4]    >           "leader":"true"},
   [junit4]    >         "core_node10":{
   [junit4]    >           "core":"testSimple1_shard2_replica_n9",
   [junit4]    >           "base_url":"http://127.0.0.1:61244/solr",
   [junit4]    >           "node_name":"127.0.0.1:61244_solr",
   [junit4]    >           "state":"down",
   [junit4]    >           "type":"NRT",
   [junit4]    >           "force_set_state":"false"}}}},
   [junit4]    >   "router":{"name":"compositeId"},
   [junit4]    >   "maxShardsPerNode":"2",
   [junit4]    >   "autoAddReplicas":"true",
   [junit4]    >   "nrtReplicas":"2",
   [junit4]    >   "tlogReplicas":"0"}
   [junit4]    > 	at __randomizedtesting.SeedInfo.seed([EE09D63ABEE1B3CF:D6BAF2C49912671E]:0)
   [junit4]    > 	at org.apache.solr.cloud.SolrCloudTestCase.waitForState(SolrCloudTestCase.java:280)
   [junit4]    > 	at org.apache.solr.cloud.autoscaling.AutoAddReplicasIntegrationTest.testSimple(AutoAddReplicasIntegrationTest.java:107)
   [junit4]    > 	at java.lang.Thread.run(Thread.java:748)
{noformat}


was (Author: steve_rowe):
{{AutoAddReplicasIntegrationTest.testSimple()}} is also failing without a seed, and the first failing commit is also {{ee09bef7}} on this issue.  E.g. (also from [https://jenkins.thetaphi.de/job/Lucene-Solr-7.x-MacOSX/875/]):

{noformat}
   [junit4]   2> NOTE: reproEE1B3CF -Dtests.slow=true -Dtests.locale=it-CH -Dtests.timezone=Brazil/Acre -Dtests.asserts=true -Dtests.file.encoding=UTF-8
   [junit4] FAILURE  129s J0 | AutoAddReplicasIntegrationTest.testSimple <<<
   [junit4]    > Throwable #1: java.lang.AssertionError: Waiting for collection testSimple1
   [junit4]    > null
   [junit4]    > Live Nodes: [127.0.0.1:61242_solr, 127.0.0.1:61245_solr]
   [junit4]    > Last available state: DocCollection(testSimple1//collections/testSimple1/state.json/20)={
   [junit4]    >   "pullReplicas":"0",
   [junit4]    >   "replicationFactor":"2",
   [junit4]    >   "shards":{
   [junit4]    >     "shard1":{
   [junit4]    >       "range":"80000000-ffffffff",
   [junit4]    >       "state":"active",
   [junit4]    >       "replicas":{
   [junit4]    >         "core_node3":{
   [junit4]    >           "core":"testSimple1_shard1_replica_n1",
   [junit4]    >           "base_url":"http://127.0.0.1:61242/solr",
   [junit4]    >           "node_name":"127.0.0.1:61242_solr",
   [junit4]    >           "state":"active",
   [junit4]    >           "type":"NRT",
   [junit4]    >           "force_set_state":"false",
   [junit4]    >           "leader":"true"},
   [junit4]    >         "core_node12":{
   [junit4]    >           "core":"testSimple1_shard1_replica_n11",
   [junit4]    >           "base_url":"http://127.0.0.1:61242/solr",
   [junit4]    >           "node_name":"127.0.0.1:61242_solr",
   [junit4]    >           "state":"active",
   [junit4]    >           "type":"NRT",
   [junit4]    >           "force_set_state":"false"}}},
   [junit4]    >     "shard2":{
   [junit4]    >       "range":"0-7fffffff",
   [junit4]    >       "state":"active",
   [junit4]    >       "replicas":{
   [junit4]    >         "core_node8":{
   [junit4]    >           "core":"testSimple1_shard2_replica_n6",
   [junit4]    >           "base_url":"http://127.0.0.1:61242/solr",
   [junit4]    >           "node_name":"127.0.0.1:61242_solr",
   [junit4]    >           "state":"active",
   [junit4]    >           "type":"NRT",
   [junit4]    >           "force_set_state":"false",
   [junit4]    >           "leader":"true"},
   [junit4]    >         "core_node10":{
   [junit4]    >           "core":"testSimple1_shard2_replica_n9",
   [junit4]    >           "base_url":"http://127.0.0.1:61244/solr",
   [junit4]    >           "node_name":"127.0.0.1:61244_solr",
   [junit4]    >           "state":"down",
   [junit4]    >           "type":"NRT",
   [junit4]    >           "force_set_state":"false"}}}},
   [junit4]    >   "router":{"name":"compositeId"},
   [junit4]    >   "maxShardsPerNode":"2",
   [junit4]    >   "autoAddReplicas":"true",
   [junit4]    >   "nrtReplicas":"2",
   [junit4]    >   "tlogReplicas":"0"}
   [junit4]    > 	at __randomizedtesting.SeedInfo.seed([EE09D63ABEE1B3CF:D6BAF2C49912671E]:0)
   [junit4]    > 	at org.apache.solr.cloud.SolrCloudTestCase.waitForState(SolrCloudTestCase.java:280)
   [junit4]    > 	at org.apache.solr.cloud.autoscaling.AutoAddReplicasIntegrationTest.testSimple(AutoAddReplicasIntegrationTest.java:107)
   [junit4]    > 	at java.lang.Thread.run(Thread.java:748)
{noformat}

> Make autoscaling policy based replica placement the default strategy for placing replicas
> -----------------------------------------------------------------------------------------
>
>                 Key: SOLR-12739
>                 URL: https://issues.apache.org/jira/browse/SOLR-12739
>             Project: Solr
>          Issue Type: Improvement
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: AutoScaling, SolrCloud
>            Reporter: Shalin Shekhar Mangar
>            Assignee: Shalin Shekhar Mangar
>            Priority: Major
>             Fix For: 7.6, master (8.0)
>
>         Attachments: SOLR-12739.patch, SOLR-12739.patch, SOLR-12739.patch, SOLR-12739.patch, SOLR-12739.patch
>
>
> Today the default placement strategy is the same one used since Solr 4.x which is to select nodes on a round robin fashion. I propose to make the autoscaling policy based replica placement as the default policy for placing replicas.
> This is related to SOLR-12648 where even though we have default cluster preferences, we don't use them unless a policy is also configured.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org