You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Vertical Search <ve...@gmail.com> on 2006/03/10 02:09:47 UTC
org.apache.nutch.net.URLFilter not found.
I am trying to crawl a site with Query string containing "?" and "=".
So, I have modified the following line in crawl-urlfilter and
regex-urlfilter as per the advise in one of the posting on archieve
# skip URLs containing certain characters as probable queries, etc.
+[?*!@=]
But still getting the exception..
Is there something am missing ?
Can some one please assist...
Thanks
Sudhi
Re: org.apache.nutch.net.URLFilter not found.
Posted by Marko Bauhardt <mb...@media-style.com>.
Sounds like nutch does not found your plugins. A stacktrace from your
exception could help.
Please verify your nutch-default.xml at the property
<property>
<name>plugin.folders</name>
<value>plugin</value>
<description>Directories where nutch plugins are located. Each
element may be a relative or absolute path. If absolute, it is used
as is. If relative, it is searched for on the classpath.</
description>
</property>
Marko