You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Ronny <ro...@metzgerei-lebek.de> on 2006/07/25 15:33:06 UTC
Please Help - Patch not working - external links still crawled
Hi all,
after installing the patch http://issues.apache.org/jira/browse/NUTCH-173 and a whole-web crawl external links will still be crawled.
I modified the nutch-site.xml as follows:
<property>
<name>crawl.ignore.external.links</name>
<value>true</value>
<description>not crwling external links</description>
</property>
What made I wrong?
Thanks for any help
Kind regards
Ronny
Re: Please Help - Patch not working - external links still crawled
Posted by Ronny <ro...@metzgerei-lebek.de>.
Hi Stefan,
I didnĀ“t do that.... What I have to do now? Rebuild with ant? Please can you
tell me how to do that?
As I am said before I am very new to nutch and Java.
Kind regards and many thanks
Ronny
----- Original Message -----
From: "Stefan Neufeind" <ap...@stefan-neufeind.de>
To: <nu...@lucene.apache.org>
Sent: Tuesday, July 25, 2006 4:23 PM
Subject: Re: Please Help - Patch not working - external links still crawled
> Ronny wrote:
>> Hi all,
>>
>> after installing the patch http://issues.apache.org/jira/browse/NUTCH-173
>> and a whole-web crawl external links will still be crawled.
>>
>> I modified the nutch-site.xml as follows:
>>
>> <property>
>>
>> <name>crawl.ignore.external.links</name>
>>
>> <value>true</value>
>>
>> <description>not crwling external links</description>
>>
>> </property>
>>
>> What made I wrong?
>
> You did not rebuild nutch, did you?
>
>
> Regards,
> Stefan
>
Re: Please Help - Patch not working - external links still crawled
Posted by Stefan Neufeind <ap...@stefan-neufeind.de>.
Ronny wrote:
> Hi all,
>
> after installing the patch http://issues.apache.org/jira/browse/NUTCH-173 and a whole-web crawl external links will still be crawled.
>
> I modified the nutch-site.xml as follows:
>
> <property>
>
> <name>crawl.ignore.external.links</name>
>
> <value>true</value>
>
> <description>not crwling external links</description>
>
> </property>
>
> What made I wrong?
You did not rebuild nutch, did you?
Regards,
Stefan