You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Stefan Scheffler <ss...@avantgarde-labs.de> on 2012/08/17 11:47:06 UTC

recrawling

Hello,
How can i do a recrawling to an existing crawldb?

With friendly regards
Stefan Scheffler

-- 
Stefan Scheffler
Avantgarde Labs GmbH
Löbauer Straße 19, 01099 Dresden
Telefon: + 49 (0) 351 21590834
Email: sscheffler@avantgarde-labs.de


RE: recrawling

Posted by Markus Jelsma <ma...@openindex.io>.
hi,

Pages will be recrawled when their eligible (last fetch time + interval). To force it you can use the -adddays switch on the generator tool. 


 
 
-----Original message-----
> From:Stefan Scheffler <ss...@avantgarde-labs.de>
> Sent: Fri 17-Aug-2012 11:54
> To: user@nutch.apache.org
> Subject: recrawling
> 
> Hello,
> How can i do a recrawling to an existing crawldb?
> 
> With friendly regards
> Stefan Scheffler
> 
> -- 
> Stefan Scheffler
> Avantgarde Labs GmbH
> Löbauer Straße 19, 01099 Dresden
> Telefon: + 49 (0) 351 21590834
> Email: sscheffler@avantgarde-labs.de
> 
>