You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Noah Silverman <no...@smartmediacorp.com> on 2009/12/17 07:17:45 UTC

Customize crawl

Hi,

More questions about Nutch.

I have a list of 1000 URLs that I want to crawl and index.  Our plan is 
to check the same sites often for updates and/or new content.  How would 
you suggest configuring Nutch for this?

Or, more generally, is there good source of documentation for all of 
this?  What I found on the Nutch website seems a little thin.

-N