You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "byron miller (JIRA)" <ji...@apache.org> on 2006/01/29 03:05:35 UTC

[jira] Commented: (NUTCH-16) boost documents matching a url pattern

    [ http://issues.apache.org/jira/browse/NUTCH-16?page=comments#action_12364354 ] 

byron miller commented on NUTCH-16:
-----------------------------------

Cool

an inverse of this plugin would be great, or enhancement of this for +/- values based on patters as i think lowering score of  domains like  i.like.to.spam.with.keywords.in.my.url.pretending.im.a.good.site.dot.com

> boost documents matching a url pattern
> --------------------------------------
>
>          Key: NUTCH-16
>          URL: http://issues.apache.org/jira/browse/NUTCH-16
>      Project: Nutch
>         Type: New Feature
>   Components: indexer
>     Reporter: Stefan Groschupf
>     Priority: Trivial
>  Attachments: boost-url-src_and_bin.zip, boostingPluginPatch.txt
>
> The attached patch is a plugin that allows to boost documents matching a url pattern. 
> This could be useful to rank documents from a intranet higher then external pages.
> A README comes with the patch.
> Any comments are welcome.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira