You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/11/12 14:59:12 UTC
[jira] [Resolved] (NUTCH-1451) Upgrade automaton jar to 1.11-8
[ https://issues.apache.org/jira/browse/NUTCH-1451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Lewis John McGibbney resolved NUTCH-1451.
-----------------------------------------
Resolution: Fixed
Committed @revision 1408282 in trunk
Committed @revision 1408289 in 2.2-SNAPSHOT
I didn't upload patches for these fixes as the generated patches contained loads of non-Utf8 characters which corrupted the file.
The fixes remove our dependency upon shipping with the automaton.jar and licenses. The automaton deps are now pulled by ivy.
> Upgrade automaton jar to 1.11-8
> -------------------------------
>
> Key: NUTCH-1451
> URL: https://issues.apache.org/jira/browse/NUTCH-1451
> Project: Nutch
> Issue Type: Improvement
> Components: parser
> Affects Versions: 1.6, 2.1
> Reporter: Lewis John McGibbney
> Assignee: Lewis John McGibbney
> Priority: Minor
> Fix For: 1.6, 2.2
>
>
> The latest version 1.11-8 was released September 7, 2011.
> This library is significantly faster than the default regex parsing. I haven't got a clue what version we currently use but the license states 2005 so I'm guessing its been a long time since it was upgraded.
> I'll get a patch together and for completeness run independent test to compare results pre and post upgrade. It would be nice to see > marginal improvements :0)
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira