You are viewing a plain text version of this content. The canonical link for it is here.
Posted to j-users@xerces.apache.org by Jochen Wiedmann <jo...@gmail.com> on 2006/02/28 11:54:29 UTC

Lossless parsing

Hi,

I have a case where I want to apply slight changes to a document. The
most part of the document should be left unchanged, though. By
"unchanged" I mean *really* unchanged: In particular the documents
syntactical representation must not be changed.

A particular difficult part seems to be the documents internal subset.
Of course, I can attach a DeclHandler and all that stuff. However, the
internal subsets formatting (in particular white space characters) is
lost. (See

    https://issues.apache.org/jira/browse/XERCESJ-1144

on  that part.) An obvious idea would be to use the SAX parsers

    http://xml.org/sax/properties/xml-string

property, but that is not supported by Xerces.

Am I lost?


Jochen


--
Whenever you find yourself on the side of the
majority, it is time to pause and reflect.
(Mark Twain)

---------------------------------------------------------------------
To unsubscribe, e-mail: j-users-unsubscribe@xerces.apache.org
For additional commands, e-mail: j-users-help@xerces.apache.org


Re: Lossless parsing

Posted by Joseph Kesselman <ke...@us.ibm.com>.
>I have a case where I want to apply slight changes to a document. The
>most part of the document should be left unchanged, though. By
>"unchanged" I mean *really* unchanged: In particular the documents
>syntactical representation must not be changed.

Process it as text?

Seriously, if that's the level you're concerned with, I don't know of any
XML tools that will suit yoru needs.

______________________________________
Joe Kesselman -- Beware of Blueshift!
"The world changed profoundly and unpredictably the day Tim Berners Lee
got bitten by a radioactive spider." -- Rafe Culpin, in r.m.filk


---------------------------------------------------------------------
To unsubscribe, e-mail: j-users-unsubscribe@xerces.apache.org
For additional commands, e-mail: j-users-help@xerces.apache.org