You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/01 23:30:52 UTC
[jira] [Commented] (NUTCH-1348) Solrindexer fails with a
java.io.IOException error.
[ https://issues.apache.org/jira/browse/NUTCH-1348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13266135#comment-13266135 ]
Markus Jelsma commented on NUTCH-1348:
--------------------------------------
This is a socket error but the request is retried so the job shouldn't fail completely. Or does it?
> Solrindexer fails with a java.io.IOException error.
> ---------------------------------------------------
>
> Key: NUTCH-1348
> URL: https://issues.apache.org/jira/browse/NUTCH-1348
> Project: Nutch
> Issue Type: Bug
> Components: indexer
> Affects Versions: 1.5
> Environment: Debian Stable AMD64
> Reporter: Christian Johnsson
>
> I'm unable to reproduce this error but it happens from time to time when i run solrindexer.
> I use the same commands as i did with 1.4 and about the same configuration and i haven't changed any solr settings.
> Have the same plugins active just to be able to compare.
> From time to time the solrindexer throws an error. It happends like 1-2 times out of 5 and there is no information in the solr log about it.
> Not sure if it's a bug but i though i might as well report it since i've been running 1.4 since it was released and never came across this error in that version.
> 2012-05-01 20:44:14,861 INFO httpclient.HttpMethodDirector - I/O exception (java.net.SocketException) caught when processing request: Connection reset
> 2012-05-01 20:44:14,861 INFO httpclient.HttpMethodDirector - Retrying request
> 2012-05-01 20:44:15,808 INFO solr.SolrWriter - Indexing 250 documents
> 2012-05-01 20:44:36,153 WARN mapred.LocalJobRunner - job_local_0001
> java.io.IOException
> at org.apache.nutch.indexer.solr.SolrWriter.makeIOException(SolrWriter.java:152)
> at org.apache.nutch.indexer.solr.SolrWriter.write(SolrWriter.java:126)
> at org.apache.nutch.indexer.IndexerOutputFormat$1.write(IndexerOutputFormat.java:55)
> at org.apache.nutch.indexer.IndexerOutputFormat$1.write(IndexerOutputFormat.java:44)
> at org.apache.hadoop.mapred.ReduceTask$3.collect(ReduceTask.java:440)
> at org.apache.nutch.indexer.IndexerMapReduce.reduce(IndexerMapReduce.java:195)
> at org.apache.nutch.indexer.IndexerMapReduce.reduce(IndexerMapReduce.java:51)
> at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:463)
> at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:411)
> at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:216)
> Caused by: org.apache.solr.client.solrj.SolrServerException: org.apache.commons.httpclient.ProtocolException: Unbuffered entity enclosing request can not be repeated.
> at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:475)
> at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244)
> at org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105)
> at org.apache.nutch.indexer.solr.SolrWriter.write(SolrWriter.java:124)
> ... 8 more
> Caused by: org.apache.commons.httpclient.ProtocolException: Unbuffered entity enclosing request can not be repeated.
> at org.apache.commons.httpclient.methods.EntityEnclosingMethod.writeRequestBody(EntityEnclosingMethod.java:487)
> at org.apache.commons.httpclient.HttpMethodBase.writeRequest(HttpMethodBase.java:2114)
> at org.apache.commons.httpclient.HttpMethodBase.execute(HttpMethodBase.java:1096)
> at org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:398)
> at org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:171)
> at org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:397)
> at org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:323)
> at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:422)
> ... 11 more
> 2012-05-01 20:44:37,074 ERROR solr.SolrIndexer - java.io.IOException: Job failed!
> It's running on a single machine and no hadoop.
> It's indexing around 50-80 000 smaller documents. Worked flawless in 1.4
> Thats about it :-)
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira