You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by og...@yahoo.com on 2008/06/06 18:12:36 UTC
Re: recrawl in 1.0
Please use nutch-user list instead of nutch-dev. I'm replying there. Nutch uses an adaptive fetch interval by observing how often a page changes and setting the "next fetch date" based on that.
Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
----- Original Message ----
> From: scottyd <sc...@homepagesdirectories.com>
> To: nutch-dev@lucene.apache.org
> Sent: Thursday, June 5, 2008 2:44:21 PM
> Subject: recrawl in 1.0
>
>
> I was wondering how to accomplish a recrawl in the trunk release of nutch.
>
> I've read through some other posts and I noticed that a lot of it was about
> creating a script to run, but that the script is for 8 and not anything new.
>
> also, is nutch set by default to do a recrawl after x amount of time?
>
> i am new to nutch as well as java and tomcat so any help is greatly
> appreciated.
> thanks!
>
>
>
> --
> View this message in context:
> http://www.nabble.com/recrawl-in-1.0-tp17676943p17676943.html
> Sent from the Nutch - Dev mailing list archive at Nabble.com.