You are viewing a plain text version of this content. The canonical link for it is here.
Posted to c-dev@xerces.apache.org by Alberto Massari <am...@datadirect.com> on 2005/04/01 10:19:39 UTC
Re: Special char within an XML ï
At 08.59 30/03/2005 -0500, Orly Caduri wrote:
>Hi,
>
>I need to parse the following xml
><?xml version=\"1.0\" encoding=\"UTF-8\"?><ROOT><ROOT1><?xml
>version="1.0"?><story><TGM> € this ï is
>a message</TGM></story></ROOT1></ROOT>
>
>I first place the string in a Dom and then extract the value of the tag
><ROOT1>. Then I place the return string into a Dom and parser it as well,
>everything works fine except that when I try to get the TGM tag the value
>is corrupted from the value ï. The value € works fine.
>
>I am right before a release and any help is greatly appreciated.
>
>I run xerces-c++ 2.2.0 on window professional
Orly,
can you post the code you use to parse the extracted string? Are you by
chance setting an encoding different from UTF-16?
Alberto
---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-c-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-c-dev-help@xml.apache.org