You are viewing a plain text version of this content. The canonical link for it is here.
Posted to j-users@xerces.apache.org by Jochen Wiedmann <jo...@gmail.com> on 2006/02/28 11:54:29 UTC
Lossless parsing
Hi,
I have a case where I want to apply slight changes to a document. The
most part of the document should be left unchanged, though. By
"unchanged" I mean *really* unchanged: In particular the documents
syntactical representation must not be changed.
A particular difficult part seems to be the documents internal subset.
Of course, I can attach a DeclHandler and all that stuff. However, the
internal subsets formatting (in particular white space characters) is
lost. (See
https://issues.apache.org/jira/browse/XERCESJ-1144
on that part.) An obvious idea would be to use the SAX parsers
http://xml.org/sax/properties/xml-string
property, but that is not supported by Xerces.
Am I lost?
Jochen
--
Whenever you find yourself on the side of the
majority, it is time to pause and reflect.
(Mark Twain)
---------------------------------------------------------------------
To unsubscribe, e-mail: j-users-unsubscribe@xerces.apache.org
For additional commands, e-mail: j-users-help@xerces.apache.org
Re: Lossless parsing
Posted by Joseph Kesselman <ke...@us.ibm.com>.
>I have a case where I want to apply slight changes to a document. The
>most part of the document should be left unchanged, though. By
>"unchanged" I mean *really* unchanged: In particular the documents
>syntactical representation must not be changed.
Process it as text?
Seriously, if that's the level you're concerned with, I don't know of any
XML tools that will suit yoru needs.
______________________________________
Joe Kesselman -- Beware of Blueshift!
"The world changed profoundly and unpredictably the day Tim Berners Lee
got bitten by a radioactive spider." -- Rafe Culpin, in r.m.filk
---------------------------------------------------------------------
To unsubscribe, e-mail: j-users-unsubscribe@xerces.apache.org
For additional commands, e-mail: j-users-help@xerces.apache.org