You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Eyeris Rodriguez Rueda <er...@uci.cu> on 2016/10/25 16:32:45 UTC
generator conditional by crawldb status
Hi all.
I am using nutch 1.12 and solr 4.10.3 with linuxmint 18.
I want to crawl pages from crawldb using this order.
1-unfetched
2-modified
3-gone
and others
I know that generator process is which decides what pages are selected or not from crawldb.
Any help or advice to crawl pages in that order will be appreciated.
Greetings.