You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Joshua J Pavel <jp...@us.ibm.com> on 2011/02/02 16:42:00 UTC

Enabling logging breaks parsing?

If I turn on logging for parsing with this line:
log4j.logger.org.apache.nutch.parse.html=TRACE,cmdstdout

I get an error parsing:

fetching http://test.site.com
Parsing...
Error parsing: http://test.site.com/en_US/index.2011.html: failed(2,200):
org.apache.nutch.parse.ParseException: Unable to successfully parse content

As soon as I turn it off, the pages are properly parsed.

Any thoughts?  Any other way to enable trace logging for parsing?