You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Sagar Naik <sa...@visvo.com> on 2007/10/29 16:53:23 UTC
Re: Crawl Problem
Kunal Wku wrote:
> Hello,
>
> I have a webpage consisting of around 300 hyperlinks to other pages. When I use the crawl using Cygwin, it is crawling around 80 pages (hyperlinks). How can I crawl over the whole webpage i.e., cover all the hyperlinks ?
>
> Thanks & Regards,
> Kunal
>
> __________________________________________________
> Do You Yahoo!?
> Tired of spam? Yahoo! Mail has the best spam protection around
> http://mail.yahoo.com
>
Hey Kunal,
Have a look "db.max.outlinks.per.page" property in the nutch conf file
- Sagar
--
This message has been scanned for viruses and
dangerous content and is believed to be clean.