You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Cheng Li <ch...@usc.edu> on 2011/07/24 05:05:40 UTC
Nutch 1.3+solr query question
Hi,
How to make the query result like the hyperlink lists as Nutch 1.2 did
in Nutch-1.3+solr frame ?
I followed the instruction from
http://wiki.apache.org/nutch/RunningNutchAndSolr ,
After inputting the command,
some error messages like this:
SolrIndexer: starting at 2011-07-23 20:02:04
java.io.IOException: Job failed!
SolrDeleteDuplicates: starting at 2011-07-23 20:02:07
SolrDeleteDuplicates: Solr url: http://localhost:8983/solr/
Exception in thread "main" java.io.IOException:
org.apache.solr.client.solrj.SolrServerException: java.net.ConnectException:
Connection refused
at
org.apache.nutch.indexer.solr.SolrDeleteDuplicates$SolrInputFormat.getSplits(SolrDeleteDuplicates.java:201)
at org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:810)
at
org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:781)
at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:730)
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1249)
at
org.apache.nutch.indexer.solr.SolrDeleteDuplicates.dedup(SolrDeleteDuplicates.java:362)
at org.apache.nutch.crawl.Crawl.run(Crawl.java:152)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at org.apache.nutch.crawl.Crawl.main(Crawl.java:54)
Caused by: org.apache.solr.client.solrj.SolrServerException:
java.net.ConnectException: Connection refused
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:484)
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:245)
at
org.apache.solr.client.solrj.request.QueryRequest.process(QueryRequest.java:89)
at org.apache.solr.client.solrj.SolrServer.query(SolrServer.java:118)
at
org.apache.nutch.indexer.solr.SolrDeleteDuplicates$SolrInputFormat.getSplits(SolrDeleteDuplicates.java:199)
... 8 more
Caused by: java.net.ConnectException: Connection refused
at java.net.PlainSocketImpl.socketConnect(Native Method)
at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:351)
at java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:213)
at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:200)
at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
at java.net.Socket.connect(Socket.java:529)
at java.net.Socket.connect(Socket.java:478)
at java.net.Socket.<init>(Socket.java:375)
at java.net.Socket.<init>(Socket.java:249)
at
org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:80)
at
org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:122)
at
org.apache.commons.httpclient.HttpConnection.open(HttpConnection.java:707)
at
org.apache.commons.httpclient.MultiThreadedHttpConnectionManager$HttpConnectionAdapter.open(MultiThreadedHttpConnectionManager.java:1361)
at
org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:387)
at
org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:171)
at
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:397)
at
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:323)
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:428)
... 12 more
how to solve this problem?
thanks,
--
Cheng Li
Re: Nutch 1.3+solr query question
Posted by Markus Jelsma <ma...@openindex.io>.
Your error is not related to queries at all but fails when indexing and
provides a clear error message:
> java.net.ConnectException: Connection refused
Check Nutch and Solr logs.
> Hi,
>
> How to make the query result like the hyperlink lists as Nutch 1.2
> did in Nutch-1.3+solr frame ?
> I followed the instruction from
> http://wiki.apache.org/nutch/RunningNutchAndSolr ,
>
> After inputting the command,
>
> some error messages like this:
>
> SolrIndexer: starting at 2011-07-23 20:02:04
> java.io.IOException: Job failed!
> SolrDeleteDuplicates: starting at 2011-07-23 20:02:07
> SolrDeleteDuplicates: Solr url: http://localhost:8983/solr/
> Exception in thread "main" java.io.IOException:
> org.apache.solr.client.solrj.SolrServerException:
> java.net.ConnectException: Connection refused
> at
> org.apache.nutch.indexer.solr.SolrDeleteDuplicates$SolrInputFormat.getSplit
> s(SolrDeleteDuplicates.java:201) at
> org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:810) at
> org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:781)
> at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:730)
> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1249)
> at
> org.apache.nutch.indexer.solr.SolrDeleteDuplicates.dedup(SolrDeleteDuplicat
> es.java:362) at org.apache.nutch.crawl.Crawl.run(Crawl.java:152)
> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
> at org.apache.nutch.crawl.Crawl.main(Crawl.java:54)
> Caused by: org.apache.solr.client.solrj.SolrServerException:
> java.net.ConnectException: Connection refused
> at
> org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttp
> SolrServer.java:484) at
> org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttp
> SolrServer.java:245) at
> org.apache.solr.client.solrj.request.QueryRequest.process(QueryRequest.java
> :89) at org.apache.solr.client.solrj.SolrServer.query(SolrServer.java:118)
> at
> org.apache.nutch.indexer.solr.SolrDeleteDuplicates$SolrInputFormat.getSplit
> s(SolrDeleteDuplicates.java:199) ... 8 more
> Caused by: java.net.ConnectException: Connection refused
> at java.net.PlainSocketImpl.socketConnect(Native Method)
> at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:351)
> at java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:213)
> at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:200)
> at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
> at java.net.Socket.connect(Socket.java:529)
> at java.net.Socket.connect(Socket.java:478)
> at java.net.Socket.<init>(Socket.java:375)
> at java.net.Socket.<init>(Socket.java:249)
> at
> org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createS
> ocket(DefaultProtocolSocketFactory.java:80) at
> org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createS
> ocket(DefaultProtocolSocketFactory.java:122) at
> org.apache.commons.httpclient.HttpConnection.open(HttpConnection.java:707)
> at
> org.apache.commons.httpclient.MultiThreadedHttpConnectionManager$HttpConnec
> tionAdapter.open(MultiThreadedHttpConnectionManager.java:1361) at
> org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMetho
> dDirector.java:387) at
> org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDi
> rector.java:171) at
> org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:397)
> at
> org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:323)
> at
> org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttp
> SolrServer.java:428) ... 12 more
>
>
>
> how to solve this problem?
>
>
> thanks,