You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by jotta <so...@gmail.com> on 2011/06/02 12:42:59 UTC

RE: Crawling process - Fetching

Thank you for advices!

I have propably found reason of my problem - in regex-urlfilter.txt I had
this line:

 -[?*!@=]

and in connection with above Nutch skips urls containing some of this
characters. I have changed this rule and till now there is no problem with
injecting/fetching :)


-----
Regards,
Jotta

PS. Sorry for my English :)
--
View this message in context: http://lucene.472066.n3.nabble.com/Crawling-process-Fetching-tp2873786p3014577.html
Sent from the Nutch - User mailing list archive at Nabble.com.