You are viewing a plain text version of this content. The canonical link for it is here.
Posted to j-dev@xerces.apache.org by "Arnold, Curt" <Cu...@hyprotech.com> on 2000/06/28 20:10:06 UTC

RE: SAX2 fatal error in Xerces-J-1.1.1

Sounds like you should use the Tidy program from W3C (http://www.w3.org/People/Raggett/tidy/) to clean up your HTML to XHTML before processing.  Definitely, it is outside the scope of an XML parser to
do HTML parsing.  If that doesn't do what you need then I would either grab its source or go looking for an HTML parser.