You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/03/29 00:51:15 UTC

[jira] [Created] (NUTCH-1550) xercesImpl and xmlParserAPIs (org.apache.xml) packages and classes only used in three Nutch classes

Lewis John McGibbney created NUTCH-1550:
-------------------------------------------

             Summary: xercesImpl and xmlParserAPIs (org.apache.xml) packages and classes only used in three Nutch classes
                 Key: NUTCH-1550
                 URL: https://issues.apache.org/jira/browse/NUTCH-1550
             Project: Nutch
          Issue Type: Improvement
          Components: build, parser
    Affects Versions: 2.1, 1.6
            Reporter: Lewis John McGibbney
            Assignee: Lewis John McGibbney
            Priority: Minor
             Fix For: 1.7, 2.2


DOMSerializerImpl from xerces is deprecated in our current artifact. It is replaced by the (still ancient but slightly newer org.apache.xml.serializer.dom3.LSSerializerImpl in [0]). 
Upon closer inspection it seems that find . | xargs grep "org.apache.xml" * only pulled up DOMBuilder, XMLChatacterRecognizer and DOMContentUtilsTest as the places where such classes are used.
I am confused as to why they are included as primary dependencies within Nutch. Either these XML specific dependencies should be restricted dependencies to parse-html or else they should be removed and replaced by the new artifact [0].  
[0] http://search.maven.org/#artifactdetails|xalan|serializer|2.7.1|jar

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira