You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Noah Silverman <no...@smartmediacorp.com> on 2009/12/17 07:17:45 UTC
Customize crawl
Hi,
More questions about Nutch.
I have a list of 1000 URLs that I want to crawl and index. Our plan is
to check the same sites often for updates and/or new content. How would
you suggest configuring Nutch for this?
Or, more generally, is there good source of documentation for all of
this? What I found on the Nutch website seems a little thin.
-N