You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@commons.apache.org by "Taro Yabuki (JIRA)" <ji...@apache.org> on 2011/07/16 10:11:59 UTC

[jira] [Updated] (LANG-728) StringEscapeUtils.escapeXml(str) does not support supplemental characters.

     [ https://issues.apache.org/jira/browse/LANG-728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Taro Yabuki updated LANG-728:
-----------------------------

    Attachment: lang_2_6_escapexml_20110716.diff

Test code and patch for org/apache/commons/lang/Entities.java.

> StringEscapeUtils.escapeXml(str) does not support supplemental characters.
> --------------------------------------------------------------------------
>
>                 Key: LANG-728
>                 URL: https://issues.apache.org/jira/browse/LANG-728
>             Project: Commons Lang
>          Issue Type: Bug
>          Components: lang.*
>    Affects Versions: 2.6
>            Reporter: Taro Yabuki
>            Priority: Minor
>              Labels: patch
>         Attachments: lang_2_6_escapexml_20110716.diff
>
>
> Hello.
> StringEscapeUtils.escapeXml(str) escapes Unicode characters greater than 0x7f to their numerical \\u equivalent:
> String str = StringEscapeUtils.escapeXml("\uD84C\uDFB4");
> System.out.println(str);
> //&#55372;&#57268;
> But, the output should be &#144308;.
> According to W3C document "Using character escapes in markup and CSS," We must use the single, code point value for supplemental character.
> http://www.w3.org/International/questions/qa-escapes
> In fact, &#55372;&#57268; is not rendered correctly in some web browsers e.g., Firefox 5.0 and Chrome 12.0.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira