You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Cheng Li <ch...@usc.edu> on 2011/07/24 05:05:40 UTC

Nutch 1.3+solr query question

Hi,

      How to make the query result like the hyperlink lists as Nutch 1.2 did
in Nutch-1.3+solr frame ?
     I followed the instruction from
http://wiki.apache.org/nutch/RunningNutchAndSolr ,

    After inputting the command,

    some error messages like this:

            SolrIndexer: starting at 2011-07-23 20:02:04
java.io.IOException: Job failed!
SolrDeleteDuplicates: starting at 2011-07-23 20:02:07
SolrDeleteDuplicates: Solr url: http://localhost:8983/solr/
Exception in thread "main" java.io.IOException:
org.apache.solr.client.solrj.SolrServerException: java.net.ConnectException:
Connection refused
    at
org.apache.nutch.indexer.solr.SolrDeleteDuplicates$SolrInputFormat.getSplits(SolrDeleteDuplicates.java:201)
    at org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:810)
    at
org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:781)
    at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:730)
    at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1249)
    at
org.apache.nutch.indexer.solr.SolrDeleteDuplicates.dedup(SolrDeleteDuplicates.java:362)
    at org.apache.nutch.crawl.Crawl.run(Crawl.java:152)
    at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
    at org.apache.nutch.crawl.Crawl.main(Crawl.java:54)
Caused by: org.apache.solr.client.solrj.SolrServerException:
java.net.ConnectException: Connection refused
    at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:484)
    at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:245)
    at
org.apache.solr.client.solrj.request.QueryRequest.process(QueryRequest.java:89)
    at org.apache.solr.client.solrj.SolrServer.query(SolrServer.java:118)
    at
org.apache.nutch.indexer.solr.SolrDeleteDuplicates$SolrInputFormat.getSplits(SolrDeleteDuplicates.java:199)
    ... 8 more
Caused by: java.net.ConnectException: Connection refused
    at java.net.PlainSocketImpl.socketConnect(Native Method)
    at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:351)
    at java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:213)
    at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:200)
    at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
    at java.net.Socket.connect(Socket.java:529)
    at java.net.Socket.connect(Socket.java:478)
    at java.net.Socket.<init>(Socket.java:375)
    at java.net.Socket.<init>(Socket.java:249)
    at
org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:80)
    at
org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:122)
    at
org.apache.commons.httpclient.HttpConnection.open(HttpConnection.java:707)
    at
org.apache.commons.httpclient.MultiThreadedHttpConnectionManager$HttpConnectionAdapter.open(MultiThreadedHttpConnectionManager.java:1361)
    at
org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:387)
    at
org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:171)
    at
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:397)
    at
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:323)
    at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:428)
    ... 12 more



  how to solve this problem?


thanks,
-- 
Cheng Li

Re: Nutch 1.3+solr query question

Posted by Markus Jelsma <ma...@openindex.io>.
Your error is not related to queries at all but fails when indexing and 
provides a clear error message:

> java.net.ConnectException: Connection refused

Check Nutch and Solr logs.

> Hi,
> 
>       How to make the query result like the hyperlink lists as Nutch 1.2
> did in Nutch-1.3+solr frame ?
>      I followed the instruction from
> http://wiki.apache.org/nutch/RunningNutchAndSolr ,
> 
>     After inputting the command,
> 
>     some error messages like this:
> 
>             SolrIndexer: starting at 2011-07-23 20:02:04
> java.io.IOException: Job failed!
> SolrDeleteDuplicates: starting at 2011-07-23 20:02:07
> SolrDeleteDuplicates: Solr url: http://localhost:8983/solr/
> Exception in thread "main" java.io.IOException:
> org.apache.solr.client.solrj.SolrServerException:
> java.net.ConnectException: Connection refused
>     at
> org.apache.nutch.indexer.solr.SolrDeleteDuplicates$SolrInputFormat.getSplit
> s(SolrDeleteDuplicates.java:201) at
> org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:810) at
> org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:781)
>     at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:730)
>     at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1249)
>     at
> org.apache.nutch.indexer.solr.SolrDeleteDuplicates.dedup(SolrDeleteDuplicat
> es.java:362) at org.apache.nutch.crawl.Crawl.run(Crawl.java:152)
>     at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
>     at org.apache.nutch.crawl.Crawl.main(Crawl.java:54)
> Caused by: org.apache.solr.client.solrj.SolrServerException:
> java.net.ConnectException: Connection refused
>     at
> org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttp
> SolrServer.java:484) at
> org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttp
> SolrServer.java:245) at
> org.apache.solr.client.solrj.request.QueryRequest.process(QueryRequest.java
> :89) at org.apache.solr.client.solrj.SolrServer.query(SolrServer.java:118)
> at
> org.apache.nutch.indexer.solr.SolrDeleteDuplicates$SolrInputFormat.getSplit
> s(SolrDeleteDuplicates.java:199) ... 8 more
> Caused by: java.net.ConnectException: Connection refused
>     at java.net.PlainSocketImpl.socketConnect(Native Method)
>     at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:351)
>     at java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:213)
>     at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:200)
>     at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
>     at java.net.Socket.connect(Socket.java:529)
>     at java.net.Socket.connect(Socket.java:478)
>     at java.net.Socket.<init>(Socket.java:375)
>     at java.net.Socket.<init>(Socket.java:249)
>     at
> org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createS
> ocket(DefaultProtocolSocketFactory.java:80) at
> org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createS
> ocket(DefaultProtocolSocketFactory.java:122) at
> org.apache.commons.httpclient.HttpConnection.open(HttpConnection.java:707)
>     at
> org.apache.commons.httpclient.MultiThreadedHttpConnectionManager$HttpConnec
> tionAdapter.open(MultiThreadedHttpConnectionManager.java:1361) at
> org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMetho
> dDirector.java:387) at
> org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDi
> rector.java:171) at
> org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:397)
>     at
> org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:323)
>     at
> org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttp
> SolrServer.java:428) ... 12 more
> 
> 
> 
>   how to solve this problem?
> 
> 
> thanks,