You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@commons.apache.org by "Gregor B. Rosenauer (JIRA)" <ji...@apache.org> on 2009/07/01 22:45:47 UTC

[jira] Commented: (LANG-507) StringEscapeUtils.unescapeJava should support \u+ notation

    [ https://issues.apache.org/jira/browse/LANG-507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12726221#action_12726221 ] 

Gregor B. Rosenauer commented on LANG-507:
------------------------------------------

I have implemented a local workaround for this, so I could add the patch if nobody else is already on it, will look into it in the next days.

> StringEscapeUtils.unescapeJava should support \u+ notation
> ----------------------------------------------------------
>
>                 Key: LANG-507
>                 URL: https://issues.apache.org/jira/browse/LANG-507
>             Project: Commons Lang
>          Issue Type: Improvement
>    Affects Versions: 2.4
>            Reporter: Gregor B. Rosenauer
>            Priority: Trivial
>             Fix For: 3.0
>
>
> Currently, when trying to unescape a String with Unicode escapes in the common notation, e.g., \u+0022, I get a NumberFormatException:
> org.apache.commons.lang.exception.NestableRuntimeException: Unable to parse unicode value: +002
> Note that the number is also parsed incorrectly as it is shortened by one character (obviously, the parser gets confused by the '+' and only takes up to 4 bytes, so it neglects the last digit).
> I am aware that in Java, Unicode is escaped as "\u" followed by 4 bytes that represent the hex code in the Unicode map, but the \u+ notation is commonly used outside the Java world and it would be very handy if StringEscapeUtils supported that, at least as an option.
> Would you please consider adding this feature to 3.0?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.