You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@olingo.apache.org by "Michael Bolz (JIRA)" <ji...@apache.org> on 2015/06/09 16:13:02 UTC

[jira] [Commented] (OLINGO-689) Olingo 2 unicode name suport

    [ https://issues.apache.org/jira/browse/OLINGO-689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14578930#comment-14578930 ] 

Michael Bolz commented on OLINGO-689:
-------------------------------------

Hi [~Sebastien],

Currently I'am evaluating this issue and I think the first and second ({{\\x10000-xEFFFF}}) mention points are both bugs.
But especially for second I have to re-check which unicode characters are allowed in the OData specification.

However a short question for your test class.
When I run it with JDK 6 I get following exception:
{code}
Caused by: java.util.regex.PatternSyntaxException: Illegal hexadecimal escape sequence near index 154
^[:A-Z_a-z\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u02ff\u0370-\u037d\u037f-\u1fff\u200c\u200d\u2070-\u218f\u2c00-\u2fef\u3001-\ud7ff\uf900-\ufdcf\ufdf0-\ufffd\x{10000}-\x{EFFFF}][:A-Z_a-z\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u02ff\u0370-\u037d\u037f-\u1fff\u200c\u200d\u2070-\u218f\u2c00-\u2fef\u3001-\udfff\x{10000}-\x{EFFFF}\uf900-\ufdcf\ufdf0-\ufffd\-\.0-9\u00b7\u0300-\u036f\u203f-\u2040]*\Z
{code}

Which Java version do you use?

Best regards,
Michael

> Olingo 2 unicode name suport
> ----------------------------
>
>                 Key: OLINGO-689
>                 URL: https://issues.apache.org/jira/browse/OLINGO-689
>             Project: Olingo
>          Issue Type: Bug
>          Components: odata2-core
>    Affects Versions: V2 2.0.4
>         Environment: Windows 7 64 bit
>            Reporter: Sébastien Lévêque
>            Assignee: Michael Bolz
>         Attachments: Test.java
>
>
> It’s seem that are some issues with regular expression in class EdmNamedImplProv.
>  
> The first one is about \\u00C0\\u00D6 part. It should be \\u00C0-\\u00D6 as it’s only alphabetic characters.
> With current implementation Õ Ä can’t be used but Ö À can be used.
>  
> The second issue is about \\x10000-\\xEFFFF, range only available for first character, but not the following ones (another bug ?).
> \x????? is not supported in regular expression.
> It should be \x?? (2 and only 2 characters) or \x{?????}.
> This part is splited like \\x10, 0 (twice), 0 to  \\xEF, F (3 time).
> This make some characters valid at first position like numeric characters or "×" (\u00D7) multiplication sign.
> But at the beginning of the expression there is \\u00C0\\u00D6\\u00D8-\\u00F6, means that \u00D7 should not be allowed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)