You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by og...@yahoo.com on 2008/06/06 18:12:36 UTC

Re: recrawl in 1.0

Please use nutch-user list instead of nutch-dev.  I'm replying there.  Nutch uses an adaptive fetch interval by observing how often a page changes and setting the "next fetch date" based on that.

 Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch


----- Original Message ----
> From: scottyd <sc...@homepagesdirectories.com>
> To: nutch-dev@lucene.apache.org
> Sent: Thursday, June 5, 2008 2:44:21 PM
> Subject: recrawl in 1.0
> 
> 
> I was wondering how to accomplish a recrawl in the trunk release of nutch.
> 
> I've read through some other posts and I noticed that a lot of it was about
> creating a script to run, but that the script is for 8 and not anything new.
> 
> also, is nutch set by default to do a recrawl after x amount of time?
> 
> i am new to nutch as well as java and tomcat so any help is greatly
> appreciated.
> thanks!
> 
> 
> 
> -- 
> View this message in context: 
> http://www.nabble.com/recrawl-in-1.0-tp17676943p17676943.html
> Sent from the Nutch - Dev mailing list archive at Nabble.com.