You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Erick Erickson (JIRA)" <ji...@apache.org> on 2017/12/25 22:16:02 UTC

[jira] [Resolved] (SOLR-11427) DELETEREPLICA with onlyIfDown specified should succeed if the host node is not present in the live_nodes Znode

     [ https://issues.apache.org/jira/browse/SOLR-11427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Erick Erickson resolved SOLR-11427.
-----------------------------------
    Resolution: Won't Fix

Thinking about this after letting it sit for a while, this is a really bad idea.

Nodes delete just fine if you don't specify onlyIfDown=true. The idea of saying "DELETEREPLICA will respect the onlyIfDown flag even if the node is marked as active if the host happens be absent" is:

1> confusing. Hey, I said onlyIfDown!

2> dangerous. Small programming or scripting errors and onlyIfDown isn't the safety valve it was intended to be

3> unnecessary. In the scenario outlined where the node died horribly and even if it's never coming back, leave off the onlyIfDown bit and the node will go away.

In short, I see no utility here, and opportunities to have unpleasant things happen so not worth the work.


> DELETEREPLICA with onlyIfDown specified should succeed if the host node is not present in the live_nodes Znode
> --------------------------------------------------------------------------------------------------------------
>
>                 Key: SOLR-11427
>                 URL: https://issues.apache.org/jira/browse/SOLR-11427
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Erick Erickson
>            Assignee: Erick Erickson
>
> The title says it pretty much, so opening up for discussion:
> Here's the problem. Let's say a node is killed via {{kill -9}}. The state.json file still says it's "active", but the node is gone from live_nodes. If the node in question never comes back, the replica's state doesn't necessarily get switched to "down", so specifying onlyIfDown fails with "node is active" message. This is all documented more thoroughly in SOLR-9361.
> The question is whether it's sufficient and/or safe to succeed in deleting the replica from state.json if the state is "active" _and_ the node is NOT present in live_nodes.
> I'm assigning to myself, but others should feel free to take it.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org