You are viewing a plain text version of this content. The canonical link for it is here.
Posted to j-dev@xerces.apache.org by "Arnold, Curt" <Cu...@hyprotech.com> on 2000/06/28 20:10:06 UTC
RE: SAX2 fatal error in Xerces-J-1.1.1
Sounds like you should use the Tidy program from W3C (http://www.w3.org/People/Raggett/tidy/) to clean up your HTML to XHTML before processing. Definitely, it is outside the scope of an XML parser to
do HTML parsing. If that doesn't do what you need then I would either grab its source or go looking for an HTML parser.