You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by obradoa <ao...@gmail.com> on 2008/01/28 06:55:10 UTC

Approaches to limit crawls to English Language or even US sites only

As my index grows, lots of time is wasted on websites in languages I do not
care about. Is it possible to limit fetches to English websites only, or
even better to US or US/commercial websites only? 

I am using Nutch 0.9 at a present. 


-- 
View this message in context: http://www.nabble.com/Approaches-to-limit-crawls-to-English-Language-or-even-US-sites-only-tp15128858p15128858.html
Sent from the Nutch - User mailing list archive at Nabble.com.