You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Scam <sc...@inbox.ru> on 2007/06/15 00:33:57 UTC

Re[2]: Any URL filter available for search.jsp?

Hello Andrzej,

Friday, June 15, 2007, 1:25, you wrote:

>> Thursday, May 31, 2007, 14:41, you wrote:
>> 
>> MR> I want to use two filters one for crawling and another for searching
>> MR> through search.jsp.
>> 
>> MR> I am currently using regex-urlfilter.txt for generate, fetch, update
>> MR> cycle. But when a user searches the sites, I want him not to see
>> MR> certain sites in the results that have been crawled.
>> 
>> MR> How can this be achieved?
>> 
>> Anyone solve this problem? I need this filter too. How to do it in the
>> best way in nutch 0.9?
>> Any thoughts?
>> 

AB> http://issues.apache.org/jira/browse/NUTCH-477

I applied the patch to trunk and build nutch but search results are not filtered still.

I placed string "-amazon" for example in the
WEB-INF/classes/regex-urlfilter.txt
file. But nothing changed. Results are not filtered (search results
contains links with "amazon" word).

Could you give me instruction what to do to make it workable?


-- 
Best regards,
 Scam                            mailto:scam@inbox.ru