You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/09/18 22:23:07 UTC

[jira] [Resolved] (NUTCH-1441) AnchorIndexingFilter should use plain HashSet

     [ https://issues.apache.org/jira/browse/NUTCH-1441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Lewis John McGibbney resolved NUTCH-1441.
-----------------------------------------

    Resolution: Fixed

Committed @revision 1387341 in trunk
Thank you Ferdy
                
> AnchorIndexingFilter should use plain HashSet
> ---------------------------------------------
>
>                 Key: NUTCH-1441
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1441
>             Project: Nutch
>          Issue Type: Bug
>            Reporter: Ferdy Galema
>            Priority: Minor
>             Fix For: 1.6, 2.1
>
>         Attachments: NUTCH-1441.patch, NUTCH-1441-trunk.patch
>
>
> AnchorIndexingFilter should use a plain HashSet, instead of WeakHashMap. WeakHashMap is unnecessary and can perhaps even cause bugs. (A WeakHashMap get its entries removed when the gc notices the keys are not elsewhere in use.)
> This patch also makes the filter a bit faster by lazy instantiating the set. (No need to create one everytime when deduplication is off).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira