You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by DANIEL CLARK <da...@verizon.net> on 2007/06/29 22:07:10 UTC

NoRouteToHostException

I'm running 0.8.1 and I'm getting the following exception.  Any help would be appreciated.

$ bin/nutch crawl urls -dir crawl -depth 3
crawl started in: crawl
rootUrlDir = urls
threads = 10
depth = 3
Injector: starting
Injector: crawlDb: crawl/crawldb
Injector: urlDir: urls
Injector: Converting injected urls to crawl db entries.
Exception in thread "main" java.net.NoRouteToHostException: No route to host
        at java.net.PlainSocketImpl.socketConnect(Native Method)
        at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333)
        at java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195)
        at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182)
        at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
        at java.net.Socket.connect(Socket.java:519)
        at java.net.Socket.connect(Socket.java:469)
        at java.net.Socket.<init>(Socket.java:366)
        at java.net.Socket.<init>(Socket.java:208)
        at org.apache.hadoop.ipc.Client$Connection.<init>(Client.java:113)
        at org.apache.hadoop.ipc.Client.getConnection(Client.java:359)
        at org.apache.hadoop.ipc.Client.call(Client.java:297)
        at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:150)
        at org.apache.hadoop.mapred.$Proxy1.getFilesystemName(Unknown Source)
        at org.apache.hadoop.mapred.JobClient.getFs(JobClient.java:214)
        at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:248)
        at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327)
        at org.apache.nutch.crawl.Injector.inject(Injector.java:138)
        at org.apache.nutch.crawl.Crawl.main(Crawl.java:105)

 
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Daniel Clark, President
DAC Systems, Inc.
5209 Nanticoke Court
Centreville, VA  20120
Cell - (703) 403-0340
Email - daniel.a.clark@verizon.net
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Re: NoRouteToHostException

Posted by Ian Holsman <li...@holsman.net>.
FYI this was resolved to be a network configuration/firewall issue and 
had nothing to do with nutch or hadoop.


Tsengtan A Shuy wrote:
> I got the same error when I ran in my cygwin environment.
> So I ran it in the windows eclipse environment, it ran OK but I still have
> some other nutch-0.9 issue to deal with.
> Please read the following web page:
> http://wiki.apache.org/nutch/RunNutchInEclipse, and 
> http://lucene.apache.org/nutch/tutorial8.html 
> Then ran it again.
>
> Adam Shuy, President
> ePacific Web Design & Hosting
> Professional Web/Software developer
> TEL: 408-272-6946
> www.epacificweb.com
> -----Original Message-----
> From: DANIEL CLARK [mailto:daniel.a.clark@verizon.net] 
> Sent: Friday, June 29, 2007 1:07 PM
> To: Nutch List
> Subject: NoRouteToHostException
>
> I'm running 0.8.1 and I'm getting the following exception.  Any help would
> be appreciated.
>
> $ bin/nutch crawl urls -dir crawl -depth 3
> crawl started in: crawl
> rootUrlDir = urls
> threads = 10
> depth = 3
> Injector: starting
> Injector: crawlDb: crawl/crawldb
> Injector: urlDir: urls
> Injector: Converting injected urls to crawl db entries.
> Exception in thread "main" java.net.NoRouteToHostException: No route to host
>         at java.net.PlainSocketImpl.socketConnect(Native Method)
>         at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333)
>         at
> java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195)
>         at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182)
>         at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
>         at java.net.Socket.connect(Socket.java:519)
>         at java.net.Socket.connect(Socket.java:469)
>         at java.net.Socket.<init>(Socket.java:366)
>         at java.net.Socket.<init>(Socket.java:208)
>         at org.apache.hadoop.ipc.Client$Connection.<init>(Client.java:113)
>         at org.apache.hadoop.ipc.Client.getConnection(Client.java:359)
>         at org.apache.hadoop.ipc.Client.call(Client.java:297)
>         at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:150)
>         at org.apache.hadoop.mapred.$Proxy1.getFilesystemName(Unknown
> Source)
>         at org.apache.hadoop.mapred.JobClient.getFs(JobClient.java:214)
>         at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:248)
>         at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327)
>         at org.apache.nutch.crawl.Injector.inject(Injector.java:138)
>         at org.apache.nutch.crawl.Crawl.main(Crawl.java:105)
>
>  
> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
> Daniel Clark, President
> DAC Systems, Inc.
> 5209 Nanticoke Court
> Centreville, VA  20120
> Cell - (703) 403-0340
> Email - daniel.a.clark@verizon.net
> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
>
>
>   



RE: NoRouteToHostException

Posted by Tsengtan A Shuy <tt...@sbcglobal.net>.
I got the same error when I ran in my cygwin environment.
So I ran it in the windows eclipse environment, it ran OK but I still have
some other nutch-0.9 issue to deal with.
Please read the following web page:
http://wiki.apache.org/nutch/RunNutchInEclipse, and 
http://lucene.apache.org/nutch/tutorial8.html 
Then ran it again.

Adam Shuy, President
ePacific Web Design & Hosting
Professional Web/Software developer
TEL: 408-272-6946
www.epacificweb.com
-----Original Message-----
From: DANIEL CLARK [mailto:daniel.a.clark@verizon.net] 
Sent: Friday, June 29, 2007 1:07 PM
To: Nutch List
Subject: NoRouteToHostException

I'm running 0.8.1 and I'm getting the following exception.  Any help would
be appreciated.

$ bin/nutch crawl urls -dir crawl -depth 3
crawl started in: crawl
rootUrlDir = urls
threads = 10
depth = 3
Injector: starting
Injector: crawlDb: crawl/crawldb
Injector: urlDir: urls
Injector: Converting injected urls to crawl db entries.
Exception in thread "main" java.net.NoRouteToHostException: No route to host
        at java.net.PlainSocketImpl.socketConnect(Native Method)
        at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333)
        at
java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195)
        at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182)
        at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
        at java.net.Socket.connect(Socket.java:519)
        at java.net.Socket.connect(Socket.java:469)
        at java.net.Socket.<init>(Socket.java:366)
        at java.net.Socket.<init>(Socket.java:208)
        at org.apache.hadoop.ipc.Client$Connection.<init>(Client.java:113)
        at org.apache.hadoop.ipc.Client.getConnection(Client.java:359)
        at org.apache.hadoop.ipc.Client.call(Client.java:297)
        at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:150)
        at org.apache.hadoop.mapred.$Proxy1.getFilesystemName(Unknown
Source)
        at org.apache.hadoop.mapred.JobClient.getFs(JobClient.java:214)
        at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:248)
        at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327)
        at org.apache.nutch.crawl.Injector.inject(Injector.java:138)
        at org.apache.nutch.crawl.Crawl.main(Crawl.java:105)

 
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Daniel Clark, President
DAC Systems, Inc.
5209 Nanticoke Court
Centreville, VA  20120
Cell - (703) 403-0340
Email - daniel.a.clark@verizon.net
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~