You are viewing a plain text version of this content. The canonical link for it is here.
Posted to c-dev@xerces.apache.org by Alberto Massari <am...@datadirect.com> on 2005/04/01 10:19:39 UTC

Re: Special char within an XML ï

At 08.59 30/03/2005 -0500, Orly Caduri wrote:
>Hi,
>
>I need to parse the following xml
><?xml version=\"1.0\" encoding=\"UTF-8\"?><ROOT><ROOT1>&lt;?xml 
>version=&quot;1.0&quot;?&gt;&lt;story&gt;&lt;TGM&gt; &#128; this &#239; is 
>a message&lt;/TGM&gt;&lt;/story&gt;</ROOT1></ROOT>
>
>I first place the string in a Dom and then extract the value of the tag 
><ROOT1>. Then I place the return string into a Dom and parser it as well, 
>everything works fine except that when I try to get the TGM tag the value 
>is corrupted from the value &#239. The value &#128 works fine.
>
>I am right before a release and any help is greatly appreciated.
>
>I run xerces-c++ 2.2.0 on window professional

Orly,
can you post the code you use to parse the extracted string? Are you by 
chance setting an encoding different from UTF-16?

Alberto



---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-c-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-c-dev-help@xml.apache.org