You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Sunnyvale Fl <su...@gmail.com> on 2006/05/16 01:39:03 UTC
protocol redirect for nutch 0.7.2
Hi,
Is there an easy way to change how nutch 0.7.2 handles protocol redirects?
Currently I believe if a site www.foo.com redirects 30x to www.bar.com,
nutch will index the content under foo instead of bar. I read that in
version 0.8 it is fixed. Is there a fix for 0.7.2, or can someone suggest
one? Thanks!