You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Scam <sc...@inbox.ru> on 2007/06/15 00:33:57 UTC
Re[2]: Any URL filter available for search.jsp?
Hello Andrzej,
Friday, June 15, 2007, 1:25, you wrote:
>> Thursday, May 31, 2007, 14:41, you wrote:
>>
>> MR> I want to use two filters one for crawling and another for searching
>> MR> through search.jsp.
>>
>> MR> I am currently using regex-urlfilter.txt for generate, fetch, update
>> MR> cycle. But when a user searches the sites, I want him not to see
>> MR> certain sites in the results that have been crawled.
>>
>> MR> How can this be achieved?
>>
>> Anyone solve this problem? I need this filter too. How to do it in the
>> best way in nutch 0.9?
>> Any thoughts?
>>
AB> http://issues.apache.org/jira/browse/NUTCH-477
I applied the patch to trunk and build nutch but search results are not filtered still.
I placed string "-amazon" for example in the
WEB-INF/classes/regex-urlfilter.txt
file. But nothing changed. Results are not filtered (search results
contains links with "amazon" word).
Could you give me instruction what to do to make it workable?
--
Best regards,
Scam mailto:scam@inbox.ru