You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/11/12 14:59:12 UTC

[jira] [Resolved] (NUTCH-1451) Upgrade automaton jar to 1.11-8

     [ https://issues.apache.org/jira/browse/NUTCH-1451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Lewis John McGibbney resolved NUTCH-1451.
-----------------------------------------

    Resolution: Fixed

Committed @revision 1408282 in trunk
Committed @revision 1408289 in 2.2-SNAPSHOT

I didn't upload patches for these fixes as the generated patches contained loads of non-Utf8 characters which corrupted the file. 
The fixes remove our dependency upon shipping with the automaton.jar and licenses. The automaton deps are now pulled by ivy. 
                
> Upgrade automaton jar to 1.11-8
> -------------------------------
>
>                 Key: NUTCH-1451
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1451
>             Project: Nutch
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.6, 2.1
>            Reporter: Lewis John McGibbney
>            Assignee: Lewis John McGibbney
>            Priority: Minor
>             Fix For: 1.6, 2.2
>
>
> The latest version 1.11-8 was released September 7, 2011.
> This library is significantly faster than the default regex parsing. I haven't got a clue what version we currently use but the license states 2005 so I'm guessing its been a long time since it was upgraded.
> I'll get a patch together and for completeness run independent test to compare results pre and post upgrade. It would be nice to see > marginal improvements :0)  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira