You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Shawn Heisey (JIRA)" <ji...@apache.org> on 2017/12/05 18:28:00 UTC

[jira] [Commented] (SOLR-3666) DataImportHandler status command in SolrCloud does not work properly

    [ https://issues.apache.org/jira/browse/SOLR-3666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16279001#comment-16279001 ] 

Shawn Heisey commented on SOLR-3666:
------------------------------------

This problem still comes up on the mailing list, five years after the issue was created.

Fixing this problem would require that when an import is started, that the handler name is written somewhere in zookeeper under the znode for the collection, along with the host/port/context of the core where the import was started.  That way, when a status request comes in, Solr will know where the import is actually happening, and can direct the status command to the same place.

One potential problem I thought of: What should Solr do if the host/port/context that has been written into ZK stops responding?  If we remove the info we wrote to ZK when that happens, but the problem is transitory in nature and the import didn't actually stop, then further status requests after the transitory problem fixes itself might not go to the core that is actually running the import, so the user may see an idle status for an import that actually still is running.

Another wrinkle: If there is no info in ZK for the handler name, should a "status" command write the info to ZK, or should that be reserved to commands that start imports?

Something else to think about: Should the info pointing further requests to the same core have an expiration?  Some imports can take *days*, and there might even be some that take longer. If there is an expiration after which the znode is removed, it would be absolutely mandatory for the time interval to be configurable, in several time units.


> DataImportHandler status command in SolrCloud does not work properly 
> ---------------------------------------------------------------------
>
>                 Key: SOLR-3666
>                 URL: https://issues.apache.org/jira/browse/SOLR-3666
>             Project: Solr
>          Issue Type: Bug
>          Components: contrib - DataImportHandler, SolrCloud
>    Affects Versions: 4.0-ALPHA
>            Reporter: Sauvik Sarkar
>
> The dataimport?command=status command does not work correctly when invoked on the node not running the DIH in a SolrCloud configuration.
> The expectation is that no matter which node is importing any other node should be able get the import status information.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org