You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by blunderboy <sa...@gmail.com> on 2012/03/20 11:56:20 UTC

Job failed while creating SolrIndex

Hi all,
After crawling the site, I want to create a solrIndex but I am getting the
following error:

*$ bin/nutch solrindex http://localhost:8983/solr/ crawl_20Mar/crawldb/
crawl_20Mar/segments/**

cygpath: can't convert empty path
SolrIndexer: starting at 2012-03-20 16:19:04
Adding 5 documents
Adding 5 documents
java.io.IOException: Job failed!

*the log file has this content at the end which i think is causing the error
:-*

2012-03-20 16:19:12,490 INFO  solr.SolrWriter - Adding 5 documents
2012-03-20 16:19:13,650 INFO  httpclient.HttpMethodDirector - I/O exception
(java.net.ConnectException) caught when processing request: Connection
refused: connect
2012-03-20 16:19:13,650 INFO  httpclient.HttpMethodDirector - Retrying
request
2012-03-20 16:19:14,651 INFO  httpclient.HttpMethodDirector - I/O exception
(java.net.ConnectException) caught when processing request: Connection
refused: connect
2012-03-20 16:19:14,652 INFO  httpclient.HttpMethodDirector - Retrying
request
2012-03-20 16:19:15,653 INFO  httpclient.HttpMethodDirector - I/O exception
(java.net.ConnectException) caught when processing request: Connection
refused: connect
2012-03-20 16:19:15,653 INFO  httpclient.HttpMethodDirector - Retrying
request
2012-03-20 16:19:16,654 INFO  solr.SolrWriter - Adding 5 documents
2012-03-20 16:19:17,655 INFO  httpclient.HttpMethodDirector - I/O exception
(java.net.ConnectException) caught when processing request: Connection
refused: connect
2012-03-20 16:19:17,655 INFO  httpclient.HttpMethodDirector - Retrying
request
2012-03-20 16:19:18,662 INFO  httpclient.HttpMethodDirector - I/O exception
(java.net.ConnectException) caught when processing request: Connection
refused: connect
2012-03-20 16:19:18,662 INFO  httpclient.HttpMethodDirector - Retrying
request
2012-03-20 16:19:19,666 INFO  httpclient.HttpMethodDirector - I/O exception
(java.net.ConnectException) caught when processing request: Connection
refused: connect
2012-03-20 16:19:19,666 INFO  httpclient.HttpMethodDirector - Retrying
request
2012-03-20 16:19:20,715 WARN  mapred.LocalJobRunner - job_local_0001
java.io.IOException
	at
org.apache.nutch.indexer.solr.SolrWriter.makeIOException(SolrWriter.java:103)
	at org.apache.nutch.indexer.solr.SolrWriter.close(SolrWriter.java:98)
	at
org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:48)
	at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:474)
	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:411)
	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:216)
Caused by: org.apache.solr.client.solrj.SolrServerException:
java.net.ConnectException: Connection refused: connect
	at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:478)
	at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244)
	at
org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105)
	at org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:49)
	at org.apache.nutch.indexer.solr.SolrWriter.close(SolrWriter.java:93)
	... 4 more
Caused by: java.net.ConnectException: Connection refused: connect
	at java.net.DualStackPlainSocketImpl.connect0(Native Method)
	at java.net.DualStackPlainSocketImpl.socketConnect(Unknown Source)
	at java.net.AbstractPlainSocketImpl.doConnect(Unknown Source)
	at java.net.AbstractPlainSocketImpl.connectToAddress(Unknown Source)
	at java.net.AbstractPlainSocketImpl.connect(Unknown Source)
	at java.net.PlainSocketImpl.connect(Unknown Source)
	at java.net.SocksSocketImpl.connect(Unknown Source)
	at java.net.Socket.connect(Unknown Source)
	at java.net.Socket.connect(Unknown Source)
	at java.net.Socket.<init>(Unknown Source)
	at java.net.Socket.<init>(Unknown Source)
	at
org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:80)
	at
org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:122)
	at
org.apache.commons.httpclient.HttpConnection.open(HttpConnection.java:707)
	at
org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:387)
	at
org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:171)
	at
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:397)
	at
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:323)
	at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:422)
	... 8 more
2012-03-20 16:19:21,319 ERROR solr.SolrIndexer - java.io.IOException: Job
failed!

Can somebody please tell me what's error ? I am trying from long time but
could not create solrIndex of meta data in the crawled html docs.

--
View this message in context: http://lucene.472066.n3.nabble.com/Job-failed-while-creating-SolrIndex-tp3841958p3841958.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Re: Job failed while creating SolrIndex

Posted by Markus Jelsma <ma...@openindex.io>.
 Your answer is in the stack trace:
 Connection refused

 On Tue, 20 Mar 2012 03:56:20 -0700 (PDT), blunderboy 
 <sa...@gmail.com> wrote:
> Hi all,
> After crawling the site, I want to create a solrIndex but I am 
> getting the
> following error:
>
> *$ bin/nutch solrindex http://localhost:8983/solr/ 
> crawl_20Mar/crawldb/
> crawl_20Mar/segments/**
>
> cygpath: can't convert empty path
> SolrIndexer: starting at 2012-03-20 16:19:04
> Adding 5 documents
> Adding 5 documents
> java.io.IOException: Job failed!
>
> *the log file has this content at the end which i think is causing 
> the error
> :-*
>
> 2012-03-20 16:19:12,490 INFO  solr.SolrWriter - Adding 5 documents
> 2012-03-20 16:19:13,650 INFO  httpclient.HttpMethodDirector - I/O 
> exception
> (java.net.ConnectException) caught when processing request: 
> Connection
> refused: connect
> 2012-03-20 16:19:13,650 INFO  httpclient.HttpMethodDirector - 
> Retrying
> request
> 2012-03-20 16:19:14,651 INFO  httpclient.HttpMethodDirector - I/O 
> exception
> (java.net.ConnectException) caught when processing request: 
> Connection
> refused: connect
> 2012-03-20 16:19:14,652 INFO  httpclient.HttpMethodDirector - 
> Retrying
> request
> 2012-03-20 16:19:15,653 INFO  httpclient.HttpMethodDirector - I/O 
> exception
> (java.net.ConnectException) caught when processing request: 
> Connection
> refused: connect
> 2012-03-20 16:19:15,653 INFO  httpclient.HttpMethodDirector - 
> Retrying
> request
> 2012-03-20 16:19:16,654 INFO  solr.SolrWriter - Adding 5 documents
> 2012-03-20 16:19:17,655 INFO  httpclient.HttpMethodDirector - I/O 
> exception
> (java.net.ConnectException) caught when processing request: 
> Connection
> refused: connect
> 2012-03-20 16:19:17,655 INFO  httpclient.HttpMethodDirector - 
> Retrying
> request
> 2012-03-20 16:19:18,662 INFO  httpclient.HttpMethodDirector - I/O 
> exception
> (java.net.ConnectException) caught when processing request: 
> Connection
> refused: connect
> 2012-03-20 16:19:18,662 INFO  httpclient.HttpMethodDirector - 
> Retrying
> request
> 2012-03-20 16:19:19,666 INFO  httpclient.HttpMethodDirector - I/O 
> exception
> (java.net.ConnectException) caught when processing request: 
> Connection
> refused: connect
> 2012-03-20 16:19:19,666 INFO  httpclient.HttpMethodDirector - 
> Retrying
> request
> 2012-03-20 16:19:20,715 WARN  mapred.LocalJobRunner - job_local_0001
> java.io.IOException
> 	at
> 
> org.apache.nutch.indexer.solr.SolrWriter.makeIOException(SolrWriter.java:103)
> 	at 
> org.apache.nutch.indexer.solr.SolrWriter.close(SolrWriter.java:98)
> 	at
> 
> org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:48)
> 	at 
> org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:474)
> 	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:411)
> 	at 
> org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:216)
> Caused by: org.apache.solr.client.solrj.SolrServerException:
> java.net.ConnectException: Connection refused: connect
> 	at
> 
> org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:478)
> 	at
> 
> org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244)
> 	at
> 
> org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105)
> 	at org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:49)
> 	at 
> org.apache.nutch.indexer.solr.SolrWriter.close(SolrWriter.java:93)
> 	... 4 more
> Caused by: java.net.ConnectException: Connection refused: connect
> 	at java.net.DualStackPlainSocketImpl.connect0(Native Method)
> 	at java.net.DualStackPlainSocketImpl.socketConnect(Unknown Source)
> 	at java.net.AbstractPlainSocketImpl.doConnect(Unknown Source)
> 	at java.net.AbstractPlainSocketImpl.connectToAddress(Unknown Source)
> 	at java.net.AbstractPlainSocketImpl.connect(Unknown Source)
> 	at java.net.PlainSocketImpl.connect(Unknown Source)
> 	at java.net.SocksSocketImpl.connect(Unknown Source)
> 	at java.net.Socket.connect(Unknown Source)
> 	at java.net.Socket.connect(Unknown Source)
> 	at java.net.Socket.<init>(Unknown Source)
> 	at java.net.Socket.<init>(Unknown Source)
> 	at
> 
> org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:80)
> 	at
> 
> org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:122)
> 	at
> 
> org.apache.commons.httpclient.HttpConnection.open(HttpConnection.java:707)
> 	at
> 
> org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:387)
> 	at
> 
> org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:171)
> 	at
> 
> org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:397)
> 	at
> 
> org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:323)
> 	at
> 
> org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:422)
> 	... 8 more
> 2012-03-20 16:19:21,319 ERROR solr.SolrIndexer - java.io.IOException: 
> Job
> failed!
>
> Can somebody please tell me what's error ? I am trying from long time 
> but
> could not create solrIndex of meta data in the crawled html docs.
>
> --
> View this message in context:
> 
> http://lucene.472066.n3.nabble.com/Job-failed-while-creating-SolrIndex-tp3841958p3841958.html
> Sent from the Nutch - User mailing list archive at Nabble.com.

-- 
 Markus Jelsma - CTO - Openindex
 http://www.linkedin.com/in/markus17
 050-8536600 / 06-50258350