You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/03/29 00:51:15 UTC
[jira] [Created] (NUTCH-1550) xercesImpl and xmlParserAPIs
(org.apache.xml) packages and classes only used in three Nutch classes
Lewis John McGibbney created NUTCH-1550:
-------------------------------------------
Summary: xercesImpl and xmlParserAPIs (org.apache.xml) packages and classes only used in three Nutch classes
Key: NUTCH-1550
URL: https://issues.apache.org/jira/browse/NUTCH-1550
Project: Nutch
Issue Type: Improvement
Components: build, parser
Affects Versions: 2.1, 1.6
Reporter: Lewis John McGibbney
Assignee: Lewis John McGibbney
Priority: Minor
Fix For: 1.7, 2.2
DOMSerializerImpl from xerces is deprecated in our current artifact. It is replaced by the (still ancient but slightly newer org.apache.xml.serializer.dom3.LSSerializerImpl in [0]).
Upon closer inspection it seems that find . | xargs grep "org.apache.xml" * only pulled up DOMBuilder, XMLChatacterRecognizer and DOMContentUtilsTest as the places where such classes are used.
I am confused as to why they are included as primary dependencies within Nutch. Either these XML specific dependencies should be restricted dependencies to parse-html or else they should be removed and replaced by the new artifact [0].
[0] http://search.maven.org/#artifactdetails|xalan|serializer|2.7.1|jar
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira