You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Dmitry Glussky <gd...@tut.by> on 2007/09/17 18:17:00 UTC
range of IP's using smb protocol
Hi,
I'm using Nutch for crawling Microsoft Windows shares using smb
protocol. I can crawl my local network if I put in my "urls" file urls
of all computers in the network. My "urls" file looks like:
smb://comp1/
smb://comp2/
..
Is it possible to put in this file range of IP addresses? I mean
something like this:
smb://192.168.18.*/ or smb://192.168.18.1/ - smb://192.168.18.255/ or
anythig else.
I tried to put my networks name (smb://network_name/) as jcifs says it
is a valid url
http://jcifs.samba.org/src/docs/api/jcifs/smb/SmbFile.html.
But Nutch fails with java.net.UnknownHostException:
fetch of smb://werewolf/ failed with: jcifs.smb.SmbException: smb://werewolf/
java.net.UnknownHostException: werewolf
at jcifs.UniAddress.getByName(UniAddress.java:301)
at jcifs.smb.SmbFile.getAddress(SmbFile.java:765)
at jcifs.smb.SmbFile.getType(SmbFile.java:1171)
at jcifs.smb.SmbFile.exists(SmbFile.java:1282)
at org.apache.nutch.protocol.smb.SMBResponse.<init>(SMBResponse.java:94)
at org.apache.nutch.protocol.smb.SMB.getProtocolOutput(SMB.java:65)
at org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:145)
Thank you.