You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by mistapony <ch...@gmail.com> on 2008/01/15 21:58:14 UTC
Re: partial crawling
I am curious about this same question. This looks like a very old thread but
is there a way to force a timeout in the fetch at all?
Thanks.
Sorantis wrote:
>
> Hi All!
> I want to add nutch crawl command to cron.
> There are few thing I need to know.
>
> When I execute ./nutch crawl ..etc. I must wait until it finishes job...
> otherwise segment wouldn't be complete...or would it?
> Once I've canceled crawling; then looked on crawl results -
> Nutch had found nothing on all keywords I had typed..
>
> So I thought that's because of incomplete crawling.
> All I want is to crawl ..lets say 15 mins and after that just stop
> crawling.
> Corespondingly all segments and db must be complete and ready to work..
> Is there any way to do this?
> Thanks!
>
--
View this message in context: http://www.nabble.com/partial-crawling-tp4136889p14846888.html
Sent from the Nutch - User mailing list archive at Nabble.com.