You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/11/04 10:33:00 UTC
[jira] [Updated] (NUTCH-2751) nutch clean does not work with
secured solr cloud
[ https://issues.apache.org/jira/browse/NUTCH-2751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sebastian Nagel updated NUTCH-2751:
-----------------------------------
Fix Version/s: 1.17
> nutch clean does not work with secured solr cloud
> -------------------------------------------------
>
> Key: NUTCH-2751
> URL: https://issues.apache.org/jira/browse/NUTCH-2751
> Project: Nutch
> Issue Type: Bug
> Components: indexer
> Affects Versions: 1.16
> Reporter: Daniel Hammling
> Priority: Critical
> Fix For: 1.17
>
>
> I am calling nutch clean to remove 404 entries from Solr, but fail with exception below.
> Adding and updating entries is working fine. Hence, index-writer config seems to be correct in general.
> Identical behaviour in 1.15 and 1.16, although SolrIndexWriter.java has been modified for delete case.
> No more ideas, where to look at....
>
> 2019-11-01 14:45:55,664 INFO solr.SolrIndexWriter - SolrIndexer: deleting 14/14 documents
> 2019-11-01 14:45:55,768 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
> , retry? 0
> 2019-11-01 14:45:55,780 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
> , retry? 1
> 2019-11-01 14:45:55,858 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
> , retry? 2
> 2019-11-01 14:45:55,887 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
> , retry? 3
> 2019-11-01 14:45:55,903 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
> , retry? 4
> 2019-11-01 14:45:55,938 ERROR impl.CloudSolrClient - Request to collection [www-int.replaceddomain.de] failed due to (0) org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
> , retry? 5
> 2019-11-01 14:45:55,938 DEBUG concurrent.ExecutorHelper - afterExecute in thread: pool-4-thread-1, runnable type: java.util.concurrent.FutureTask
> 2019-11-01 14:45:55,940 INFO mapred.LocalJobRunner - reduce task executor complete.
> 2019-11-01 14:45:55,941 WARN mapred.LocalJobRunner - job_local2086525572_0001
> java.lang.Exception: org.apache.solr.client.solrj.impl.CloudSolrClient$RouteException: IOException occured when talking to server at: http://10.10.0.96:10983/solr/www-int.replaceddomain.de_shard1_replica_n5
> at org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:491)
> at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:558)
> Caused by: org.apache.solr.client.solrj.impl.CloudSolrClient$RouteException: IOException occured when talking to server at: http://10.10.0.96:10983/solr/www-int.replaceddomain.de_shard1_replica_n5
> at org.apache.solr.client.solrj.impl.CloudSolrClient.directUpdate(CloudSolrClient.java:553)
> at org.apache.solr.client.solrj.impl.CloudSolrClient.sendRequest(CloudSolrClient.java:1014)
> at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:885)
> at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:947)
> at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:947)
> at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:947)
> at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:947)
> at org.apache.solr.client.solrj.impl.CloudSolrClient.requestWithRetryOnStaleState(CloudSolrClient.java:947)
> at org.apache.solr.client.solrj.impl.CloudSolrClient.request(CloudSolrClient.java:818)
> at org.apache.solr.client.solrj.SolrClient.request(SolrClient.java:1219)
> at org.apache.nutch.indexwriter.solr.SolrIndexWriter.push(SolrIndexWriter.java:270)
> at org.apache.nutch.indexwriter.solr.SolrIndexWriter.commit(SolrIndexWriter.java:214)
> at org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:205)
> at org.apache.nutch.indexer.IndexWriters.close(IndexWriters.java:257)
> at org.apache.nutch.indexer.CleaningJob$DeleterReducer.cleanup(CleaningJob.java:115)
> at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:179)
> at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:627)
> at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:389)
> at org.apache.hadoop.mapred.LocalJobRunner$Job$ReduceTaskRunnable.run(LocalJobRunner.java:346)
> at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
> at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> at java.lang.Thread.run(Thread.java:748)
> Caused by: org.apache.solr.client.solrj.SolrServerException: IOException occured when talking to server at: http://10.10.0.96:10983/solr/www-int.replaceddomain.de_shard1_replica_n5
> at org.apache.solr.client.solrj.impl.HttpSolrClient.executeMethod(HttpSolrClient.java:657)
> at org.apache.solr.client.solrj.impl.HttpSolrClient.request(HttpSolrClient.java:255)
> at org.apache.solr.client.solrj.impl.HttpSolrClient.request(HttpSolrClient.java:244)
> at org.apache.solr.client.solrj.impl.LBHttpSolrClient.doRequest(LBHttpSolrClient.java:483)
> at org.apache.solr.client.solrj.impl.LBHttpSolrClient.request(LBHttpSolrClient.java:413)
> at org.apache.solr.client.solrj.impl.CloudSolrClient.lambda$directUpdate$0(CloudSolrClient.java:528)
> at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> at org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.lambda$execute$0(ExecutorUtil.java:188)
> ... 3 more
> Caused by: org.apache.http.client.ClientProtocolException
> at org.apache.http.impl.client.InternalHttpClient.doExecute(InternalHttpClient.java:187)
> at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:83)
> at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:56)
> at org.apache.solr.client.solrj.impl.HttpSolrClient.executeMethod(HttpSolrClient.java:542)
> ... 10 more
> Caused by: org.apache.http.client.NonRepeatableRequestException: Cannot retry request with a non-repeatable request entity.
> at org.apache.http.impl.execchain.MainClientExec.execute(MainClientExec.java:226)
> at org.apache.http.impl.execchain.ProtocolExec.execute(ProtocolExec.java:185)
> at org.apache.http.impl.execchain.RetryExec.execute(RetryExec.java:89)
> at org.apache.http.impl.execchain.RedirectExec.execute(RedirectExec.java:111)
> at org.apache.http.impl.client.InternalHttpClient.doExecute(InternalHttpClient.java:185)
> ... 13 more
--
This message was sent by Atlassian Jira
(v8.3.4#803005)