You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by blunderboy <sa...@gmail.com> on 2012/03/20 11:56:20 UTC
Job failed while creating SolrIndex
Hi all,
After crawling the site, I want to create a solrIndex but I am getting the
following error:
*$ bin/nutch solrindex http://localhost:8983/solr/ crawl_20Mar/crawldb/
crawl_20Mar/segments/**
cygpath: can't convert empty path
SolrIndexer: starting at 2012-03-20 16:19:04
Adding 5 documents
Adding 5 documents
java.io.IOException: Job failed!
*the log file has this content at the end which i think is causing the error
:-*
2012-03-20 16:19:12,490 INFO solr.SolrWriter - Adding 5 documents
2012-03-20 16:19:13,650 INFO httpclient.HttpMethodDirector - I/O exception
(java.net.ConnectException) caught when processing request: Connection
refused: connect
2012-03-20 16:19:13,650 INFO httpclient.HttpMethodDirector - Retrying
request
2012-03-20 16:19:14,651 INFO httpclient.HttpMethodDirector - I/O exception
(java.net.ConnectException) caught when processing request: Connection
refused: connect
2012-03-20 16:19:14,652 INFO httpclient.HttpMethodDirector - Retrying
request
2012-03-20 16:19:15,653 INFO httpclient.HttpMethodDirector - I/O exception
(java.net.ConnectException) caught when processing request: Connection
refused: connect
2012-03-20 16:19:15,653 INFO httpclient.HttpMethodDirector - Retrying
request
2012-03-20 16:19:16,654 INFO solr.SolrWriter - Adding 5 documents
2012-03-20 16:19:17,655 INFO httpclient.HttpMethodDirector - I/O exception
(java.net.ConnectException) caught when processing request: Connection
refused: connect
2012-03-20 16:19:17,655 INFO httpclient.HttpMethodDirector - Retrying
request
2012-03-20 16:19:18,662 INFO httpclient.HttpMethodDirector - I/O exception
(java.net.ConnectException) caught when processing request: Connection
refused: connect
2012-03-20 16:19:18,662 INFO httpclient.HttpMethodDirector - Retrying
request
2012-03-20 16:19:19,666 INFO httpclient.HttpMethodDirector - I/O exception
(java.net.ConnectException) caught when processing request: Connection
refused: connect
2012-03-20 16:19:19,666 INFO httpclient.HttpMethodDirector - Retrying
request
2012-03-20 16:19:20,715 WARN mapred.LocalJobRunner - job_local_0001
java.io.IOException
at
org.apache.nutch.indexer.solr.SolrWriter.makeIOException(SolrWriter.java:103)
at org.apache.nutch.indexer.solr.SolrWriter.close(SolrWriter.java:98)
at
org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:48)
at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:474)
at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:411)
at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:216)
Caused by: org.apache.solr.client.solrj.SolrServerException:
java.net.ConnectException: Connection refused: connect
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:478)
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244)
at
org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105)
at org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:49)
at org.apache.nutch.indexer.solr.SolrWriter.close(SolrWriter.java:93)
... 4 more
Caused by: java.net.ConnectException: Connection refused: connect
at java.net.DualStackPlainSocketImpl.connect0(Native Method)
at java.net.DualStackPlainSocketImpl.socketConnect(Unknown Source)
at java.net.AbstractPlainSocketImpl.doConnect(Unknown Source)
at java.net.AbstractPlainSocketImpl.connectToAddress(Unknown Source)
at java.net.AbstractPlainSocketImpl.connect(Unknown Source)
at java.net.PlainSocketImpl.connect(Unknown Source)
at java.net.SocksSocketImpl.connect(Unknown Source)
at java.net.Socket.connect(Unknown Source)
at java.net.Socket.connect(Unknown Source)
at java.net.Socket.<init>(Unknown Source)
at java.net.Socket.<init>(Unknown Source)
at
org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:80)
at
org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:122)
at
org.apache.commons.httpclient.HttpConnection.open(HttpConnection.java:707)
at
org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:387)
at
org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:171)
at
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:397)
at
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:323)
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:422)
... 8 more
2012-03-20 16:19:21,319 ERROR solr.SolrIndexer - java.io.IOException: Job
failed!
Can somebody please tell me what's error ? I am trying from long time but
could not create solrIndex of meta data in the crawled html docs.
--
View this message in context: http://lucene.472066.n3.nabble.com/Job-failed-while-creating-SolrIndex-tp3841958p3841958.html
Sent from the Nutch - User mailing list archive at Nabble.com.
Re: Job failed while creating SolrIndex
Posted by Markus Jelsma <ma...@openindex.io>.
Your answer is in the stack trace:
Connection refused
On Tue, 20 Mar 2012 03:56:20 -0700 (PDT), blunderboy
<sa...@gmail.com> wrote:
> Hi all,
> After crawling the site, I want to create a solrIndex but I am
> getting the
> following error:
>
> *$ bin/nutch solrindex http://localhost:8983/solr/
> crawl_20Mar/crawldb/
> crawl_20Mar/segments/**
>
> cygpath: can't convert empty path
> SolrIndexer: starting at 2012-03-20 16:19:04
> Adding 5 documents
> Adding 5 documents
> java.io.IOException: Job failed!
>
> *the log file has this content at the end which i think is causing
> the error
> :-*
>
> 2012-03-20 16:19:12,490 INFO solr.SolrWriter - Adding 5 documents
> 2012-03-20 16:19:13,650 INFO httpclient.HttpMethodDirector - I/O
> exception
> (java.net.ConnectException) caught when processing request:
> Connection
> refused: connect
> 2012-03-20 16:19:13,650 INFO httpclient.HttpMethodDirector -
> Retrying
> request
> 2012-03-20 16:19:14,651 INFO httpclient.HttpMethodDirector - I/O
> exception
> (java.net.ConnectException) caught when processing request:
> Connection
> refused: connect
> 2012-03-20 16:19:14,652 INFO httpclient.HttpMethodDirector -
> Retrying
> request
> 2012-03-20 16:19:15,653 INFO httpclient.HttpMethodDirector - I/O
> exception
> (java.net.ConnectException) caught when processing request:
> Connection
> refused: connect
> 2012-03-20 16:19:15,653 INFO httpclient.HttpMethodDirector -
> Retrying
> request
> 2012-03-20 16:19:16,654 INFO solr.SolrWriter - Adding 5 documents
> 2012-03-20 16:19:17,655 INFO httpclient.HttpMethodDirector - I/O
> exception
> (java.net.ConnectException) caught when processing request:
> Connection
> refused: connect
> 2012-03-20 16:19:17,655 INFO httpclient.HttpMethodDirector -
> Retrying
> request
> 2012-03-20 16:19:18,662 INFO httpclient.HttpMethodDirector - I/O
> exception
> (java.net.ConnectException) caught when processing request:
> Connection
> refused: connect
> 2012-03-20 16:19:18,662 INFO httpclient.HttpMethodDirector -
> Retrying
> request
> 2012-03-20 16:19:19,666 INFO httpclient.HttpMethodDirector - I/O
> exception
> (java.net.ConnectException) caught when processing request:
> Connection
> refused: connect
> 2012-03-20 16:19:19,666 INFO httpclient.HttpMethodDirector -
> Retrying
> request
> 2012-03-20 16:19:20,715 WARN mapred.LocalJobRunner - job_local_0001
> java.io.IOException
> at
>
> org.apache.nutch.indexer.solr.SolrWriter.makeIOException(SolrWriter.java:103)
> at
> org.apache.nutch.indexer.solr.SolrWriter.close(SolrWriter.java:98)
> at
>
> org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:48)
> at
> org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:474)
> at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:411)
> at
> org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:216)
> Caused by: org.apache.solr.client.solrj.SolrServerException:
> java.net.ConnectException: Connection refused: connect
> at
>
> org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:478)
> at
>
> org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244)
> at
>
> org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105)
> at org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:49)
> at
> org.apache.nutch.indexer.solr.SolrWriter.close(SolrWriter.java:93)
> ... 4 more
> Caused by: java.net.ConnectException: Connection refused: connect
> at java.net.DualStackPlainSocketImpl.connect0(Native Method)
> at java.net.DualStackPlainSocketImpl.socketConnect(Unknown Source)
> at java.net.AbstractPlainSocketImpl.doConnect(Unknown Source)
> at java.net.AbstractPlainSocketImpl.connectToAddress(Unknown Source)
> at java.net.AbstractPlainSocketImpl.connect(Unknown Source)
> at java.net.PlainSocketImpl.connect(Unknown Source)
> at java.net.SocksSocketImpl.connect(Unknown Source)
> at java.net.Socket.connect(Unknown Source)
> at java.net.Socket.connect(Unknown Source)
> at java.net.Socket.<init>(Unknown Source)
> at java.net.Socket.<init>(Unknown Source)
> at
>
> org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:80)
> at
>
> org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:122)
> at
>
> org.apache.commons.httpclient.HttpConnection.open(HttpConnection.java:707)
> at
>
> org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:387)
> at
>
> org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:171)
> at
>
> org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:397)
> at
>
> org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:323)
> at
>
> org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:422)
> ... 8 more
> 2012-03-20 16:19:21,319 ERROR solr.SolrIndexer - java.io.IOException:
> Job
> failed!
>
> Can somebody please tell me what's error ? I am trying from long time
> but
> could not create solrIndex of meta data in the crawled html docs.
>
> --
> View this message in context:
>
> http://lucene.472066.n3.nabble.com/Job-failed-while-creating-SolrIndex-tp3841958p3841958.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
--
Markus Jelsma - CTO - Openindex
http://www.linkedin.com/in/markus17
050-8536600 / 06-50258350