You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@commons.apache.org by "Henri Yandell (JIRA)" <ji...@apache.org> on 2011/07/19 07:04:00 UTC

[jira] [Reopened] (LANG-728) StringEscapeUtils.escapeXml(str) does not support supplemental characters.

     [ https://issues.apache.org/jira/browse/LANG-728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Henri Yandell reopened LANG-728:
--------------------------------


Reopening as I explained things badly. 

> StringEscapeUtils.escapeXml(str) does not support supplemental characters.
> --------------------------------------------------------------------------
>
>                 Key: LANG-728
>                 URL: https://issues.apache.org/jira/browse/LANG-728
>             Project: Commons Lang
>          Issue Type: Bug
>          Components: lang.*
>    Affects Versions: 2.6
>            Reporter: Taro Yabuki
>            Priority: Minor
>              Labels: patch
>             Fix For: 3.0.1
>
>         Attachments: lang_2_6_escapexml_20110716.diff
>
>
> Hello.
> StringEscapeUtils.escapeXml(str) escapes Unicode characters greater than 0x7f to their numerical \\u equivalent:
> String str = StringEscapeUtils.escapeXml("\uD84C\uDFB4");
> System.out.println(str);
> //&#55372;&#57268;
> But, the output should be &#144308;.
> According to W3C document "Using character escapes in markup and CSS," We must use the single, code point value for supplemental character.
> http://www.w3.org/International/questions/qa-escapes
> In fact, &#55372;&#57268; is not rendered correctly in some web browsers e.g., Firefox 5.0 and Chrome 12.0.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira