You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Ronny <ro...@metzgerei-lebek.de> on 2006/07/25 15:33:06 UTC

Please Help - Patch not working - external links still crawled

Hi all,

after installing the patch http://issues.apache.org/jira/browse/NUTCH-173 and a whole-web crawl external links will still be crawled.

 I modified the nutch-site.xml as follows:

<property>

<name>crawl.ignore.external.links</name>

<value>true</value>

<description>not crwling external links</description>

</property>

What made I wrong?

Thanks for any help

Kind regards

Ronny

Re: Please Help - Patch not working - external links still crawled

Posted by Ronny <ro...@metzgerei-lebek.de>.
Hi Stefan,

I didnĀ“t do that.... What I have to do now? Rebuild with ant? Please can you 
tell me how to do that?

As I am said before I am very new to nutch and Java.

Kind regards and many thanks
Ronny


----- Original Message ----- 
From: "Stefan Neufeind" <ap...@stefan-neufeind.de>
To: <nu...@lucene.apache.org>
Sent: Tuesday, July 25, 2006 4:23 PM
Subject: Re: Please Help - Patch not working - external links still crawled


> Ronny wrote:
>> Hi all,
>>
>> after installing the patch http://issues.apache.org/jira/browse/NUTCH-173 
>> and a whole-web crawl external links will still be crawled.
>>
>>  I modified the nutch-site.xml as follows:
>>
>> <property>
>>
>> <name>crawl.ignore.external.links</name>
>>
>> <value>true</value>
>>
>> <description>not crwling external links</description>
>>
>> </property>
>>
>> What made I wrong?
>
> You did not rebuild nutch, did you?
>
>
> Regards,
> Stefan
> 



Re: Please Help - Patch not working - external links still crawled

Posted by Stefan Neufeind <ap...@stefan-neufeind.de>.
Ronny wrote:
> Hi all,
> 
> after installing the patch http://issues.apache.org/jira/browse/NUTCH-173 and a whole-web crawl external links will still be crawled.
> 
>  I modified the nutch-site.xml as follows:
> 
> <property>
> 
> <name>crawl.ignore.external.links</name>
> 
> <value>true</value>
> 
> <description>not crwling external links</description>
> 
> </property>
> 
> What made I wrong?

You did not rebuild nutch, did you?


Regards,
 Stefan