You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@xalan.apache.org by "Christopher Taylor (JIRA)" <ji...@apache.org> on 2012/08/27 14:11:07 UTC

[jira] [Comment Edited] (XALANJ-2560) ToXMLStream does not support unicode supplementary characters

    [ https://issues.apache.org/jira/browse/XALANJ-2560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13442369#comment-13442369 ] 

Christopher Taylor edited comment on XALANJ-2560 at 8/27/12 11:09 PM:
----------------------------------------------------------------------

possible dupe of XALANJ-2419, also referenced in http://stackoverflow.com/questions/10511474/surrogate-pair-handling-in-xalan-2-7-1
                
      was (Author: ccmtaylor):
    possible dupe of XALANJ-2419
                  
> ToXMLStream does not support unicode supplementary characters
> -------------------------------------------------------------
>
>                 Key: XALANJ-2560
>                 URL: https://issues.apache.org/jira/browse/XALANJ-2560
>             Project: XalanJ2
>          Issue Type: Bug
>      Security Level: No security risk; visible to anyone(Ordinary problems in Xalan projects.  Anybody can view the issue.) 
>          Components: Serialization
>    Affects Versions: 2.7.1
>         Environment: Xalan 2.7.1 serializer.
> Tested on Ubuntu 12.04 with Oracle JDK 1.7.0_05.
>            Reporter: Damien Guillaume
>              Labels: serialization, unicode
>
> org.apache.xml.serializer.ToXMLStream (which extends ToStream) does not support serialization of unicode supplementary characters such as U+1D49C. It creates invalid characters entities like "&#55349;&#56476;" instead of "&#119964;" (or F0 9D 92 9C with UTF-8). ToXMLStream is used by LSSerializer when Xalan's serializer is on the classpath.
> org.apache.xml.serialize.DOMSerializerImpl (included in Xerces) does not have this problem, but it is deprecated since Xerces 2.9.0, so this is a regression.
> See http://stackoverflow.com/questions/11952289/serializing-supplementary-unicode-characters-into-xml-documents-with-java for more details.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@xalan.apache.org
For additional commands, e-mail: dev-help@xalan.apache.org