You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/30 15:32:34 UTC

[jira] [Reopened] (NUTCH-1441) AnchorIndexingFilter should use plain HashSet

     [ https://issues.apache.org/jira/browse/NUTCH-1441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ferdy Galema reopened NUTCH-1441:
---------------------------------

    
> AnchorIndexingFilter should use plain HashSet
> ---------------------------------------------
>
>                 Key: NUTCH-1441
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1441
>             Project: Nutch
>          Issue Type: Bug
>            Reporter: Ferdy Galema
>            Priority: Minor
>             Fix For: 1.6, 2.1
>
>         Attachments: NUTCH-1441.patch
>
>
> AnchorIndexingFilter should use a plain HashSet, instead of WeakHashMap. WeakHashMap is unnecessary and can perhaps even cause bugs. (A WeakHashMap get its entries removed when the gc notices the keys are not elsewhere in use.)
> This patch also makes the filter a bit faster by lazy instantiating the set. (No need to create one everytime when deduplication is off).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira