You are viewing a plain text version of this content. The canonical link for it is here.
Posted to p-dev@xerces.apache.org by Gero Meißner <me...@zentrale.heinze.de> on 2001/07/25 13:05:38 UTC

substitution of utf-8 sequences

When I try the following...

my $file = 'source.xml';
my $parser = new XML::Xerces::DOMParser();
$parser->parse(XML::Xerces::LocalFileInputSource->new($file));

.. all unicode-references (ex.: &#228;) are replaced.

So printing the DOM tree to a file with XML::Xerces::DOMParse::print creates
a new document with errors because of the missing unicodes.

How can I parse and manipulate a document without all unicode-references
being replaced? They have to be the same in the source-file and in the
created file.

Gero Meißner

Heinze GmbH
Geschäftsstelle Ost
Paradiesstraße 206B
12526 Berlin

Tel: +49 (030) 67 97 09 24
Fax: +49 (030) 67 97 09 34
email: gero.meissner@heinze.de


---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-p-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-p-dev-help@xml.apache.org