You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/03/29 01:27:15 UTC
[jira] [Resolved] (NUTCH-1550) xercesImpl and xmlParserAPIs
(org.apache.xml) packages and classes only used in three Nutch classes
[ https://issues.apache.org/jira/browse/NUTCH-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Lewis John McGibbney resolved NUTCH-1550.
-----------------------------------------
Resolution: Invalid
This is utter garbage. Xerces uses many org.w3c imports which we pull from these dependencies. I'm closing as garbage.
> xercesImpl and xmlParserAPIs (org.apache.xml) packages and classes only used in three Nutch classes
> ---------------------------------------------------------------------------------------------------
>
> Key: NUTCH-1550
> URL: https://issues.apache.org/jira/browse/NUTCH-1550
> Project: Nutch
> Issue Type: Improvement
> Components: build, parser
> Affects Versions: 1.6, 2.1
> Reporter: Lewis John McGibbney
> Assignee: Lewis John McGibbney
> Priority: Minor
> Fix For: 1.7, 2.2
>
>
> DOMSerializerImpl from xerces is deprecated in our current artifact. It is replaced by the (still ancient but slightly newer org.apache.xml.serializer.dom3.LSSerializerImpl in [0]).
> Upon closer inspection it seems that find . | xargs grep "org.apache.xml" * only pulled up DOMBuilder, XMLChatacterRecognizer and DOMContentUtilsTest as the places where such classes are used.
> I am confused as to why they are included as primary dependencies within Nutch. Either these XML specific dependencies should be restricted dependencies to parse-html or else they should be removed and replaced by the new artifact [0].
> [0] http://search.maven.org/#artifactdetails|xalan|serializer|2.7.1|jar
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira