You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Tomislav Poljak <tp...@gmail.com> on 2007/12/12 13:15:34 UTC

Regex while fetching

Hi,
I am trying to debug following exception:
 
ERROR http.Http - at java.util.regex.Pattern
$Curly.match0(Pattern.java:3773)

this exception occurs a lot while fetching so my question is: why Nutch
uses regex in fetching phase, is it for url filtering? Shouldn't
fetchlist be already filtered after generate phase?

Thanks,
       Tomislav