You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by remi tassing <ta...@gmail.com> on 2012/01/29 08:10:23 UTC

undo "db_gone"

Hi,

I understand when a url is classified as "db_gone", Nutch won't bother
fetch it again. I have many urls in this situation that I would like to
recrawl.

Any idea how to fix it?

Remi

Re: undo "db_gone"

Posted by remi tassing <ta...@gmail.com>.
I'm using Solr-3.4.

I honestly didn't get that message Mark

Remi

On Sunday, January 29, 2012, Markus Jelsma <ma...@openindex.io>
wrote:
> In trunk you can use generate.restrict.status to generate records for that
> status.
>
>> Hi,
>>
>> I understand when a url is classified as "db_gone", Nutch won't bother
>> fetch it again. I have many urls in this situation that I would like to
>> recrawl.
>>
>> Any idea how to fix it?
>>
>> Remi
>

Re: undo "db_gone"

Posted by Markus Jelsma <ma...@openindex.io>.
In trunk you can use generate.restrict.status to generate records for that 
status.

> Hi,
> 
> I understand when a url is classified as "db_gone", Nutch won't bother
> fetch it again. I have many urls in this situation that I would like to
> recrawl.
> 
> Any idea how to fix it?
> 
> Remi