You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/03/29 01:27:15 UTC

[jira] [Resolved] (NUTCH-1550) xercesImpl and xmlParserAPIs (org.apache.xml) packages and classes only used in three Nutch classes

     [ https://issues.apache.org/jira/browse/NUTCH-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Lewis John McGibbney resolved NUTCH-1550.
-----------------------------------------

    Resolution: Invalid

This is utter garbage. Xerces uses many org.w3c imports which we pull from these dependencies. I'm closing as garbage.
                
> xercesImpl and xmlParserAPIs (org.apache.xml) packages and classes only used in three Nutch classes
> ---------------------------------------------------------------------------------------------------
>
>                 Key: NUTCH-1550
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1550
>             Project: Nutch
>          Issue Type: Improvement
>          Components: build, parser
>    Affects Versions: 1.6, 2.1
>            Reporter: Lewis John McGibbney
>            Assignee: Lewis John McGibbney
>            Priority: Minor
>             Fix For: 1.7, 2.2
>
>
> DOMSerializerImpl from xerces is deprecated in our current artifact. It is replaced by the (still ancient but slightly newer org.apache.xml.serializer.dom3.LSSerializerImpl in [0]). 
> Upon closer inspection it seems that find . | xargs grep "org.apache.xml" * only pulled up DOMBuilder, XMLChatacterRecognizer and DOMContentUtilsTest as the places where such classes are used.
> I am confused as to why they are included as primary dependencies within Nutch. Either these XML specific dependencies should be restricted dependencies to parse-html or else they should be removed and replaced by the new artifact [0].  
> [0] http://search.maven.org/#artifactdetails|xalan|serializer|2.7.1|jar

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira