You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Vertical Search <ve...@gmail.com> on 2006/03/10 02:09:47 UTC

org.apache.nutch.net.URLFilter not found.

I am trying to crawl a site with Query string  containing "?" and "=".
So, I have modified the following line in crawl-urlfilter and
regex-urlfilter as per the advise in one of the posting on archieve

# skip URLs containing certain characters as probable queries, etc.
+[?*!@=]

But still getting the exception..
Is there something am missing ?

Can some one please assist...

Thanks
Sudhi

Re: org.apache.nutch.net.URLFilter not found.

Posted by Marko Bauhardt <mb...@media-style.com>.
Sounds like nutch does not found your plugins. A stacktrace from your  
exception could help.
Please verify your nutch-default.xml at the property

<property>
   <name>plugin.folders</name>
   <value>plugin</value>
   <description>Directories where nutch plugins are located.  Each
   element may be a relative or absolute path.  If absolute, it is used
   as is.  If relative, it is searched for on the classpath.</ 
description>
</property>


Marko