You are viewing a plain text version of this content. The canonical link for it is here.

Posted to c-dev@xerces.apache.org by bu...@apache.org on 2003/12/01 17:19:47 UTC

DO NOT REPLY [Bug 21556] - UTF-8 Data getting corrupted after Parsing

DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=21556>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=21556

UTF-8 Data getting corrupted after Parsing

alby@exceloncorp.com changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|                            |INVALID



------- Additional Comments From alby@exceloncorp.com  2003-12-01 16:19 -------
You are using a class, CGGString, to perform the serialization of the node, 
but we cannot see how this class works. However, I see that you are 
serializing an XML header containing the encoding="shift-jis" declaration, so 
I guess that class will generate SHIFT-JIS-encoded character.
>From what I know, UTF-8 and SHIFT-JIS have different ways of encoding the same 
Japanese character, so it's expected for the output file to be different from 
the input file.

If you wish to reopen the bug, please include the source for that CGString 
class.

Alberto

---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-c-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-c-dev-help@xml.apache.org