You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@avro.apache.org by "Thiruvalluvan M. G. (JIRA)" <ji...@apache.org> on 2018/04/01 16:19:00 UTC

[jira] [Updated] (AVRO-1191) C++ json encoder uses \U instead of \u

     [ https://issues.apache.org/jira/browse/AVRO-1191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Thiruvalluvan M. G. updated AVRO-1191:
--------------------------------------
    Fix Version/s: 1.8.3
           Status: Patch Available  (was: Open)

> C++ json encoder uses \U instead of \u
> --------------------------------------
>
>                 Key: AVRO-1191
>                 URL: https://issues.apache.org/jira/browse/AVRO-1191
>             Project: Avro
>          Issue Type: Bug
>          Components: c++
>    Affects Versions: 1.7.0
>            Reporter: Keh-Li Sheng
>            Priority: Major
>             Fix For: 1.8.3
>
>
> From the JSON spec http://www.ietf.org/rfc/rfc4627.txt:
> {quote}
> 2.5.  Strings
>    The representation of strings is similar to conventions used in the C
>    family of programming languages.  A string begins and ends with
>    quotation marks.  All Unicode characters may be placed within the
>    quotation marks except for the characters that must be escaped:
>    quotation mark, reverse solidus, and the control characters (U+0000
>    through U+001F).
>    Any character may be escaped.  If the character is in the Basic
>    Multilingual Plane (U+0000 through U+FFFF), then it may be
>    represented as a six-character sequence: a reverse solidus, followed
>    by the lowercase letter u, followed by four hexadecimal digits that
>    encode the character's code point.  The hexadecimal letters A though
>    F can be upper or lowercase.  So, for example, a string containing
>    only a single reverse solidus character may be represented as
>    "\u005C".
> {quote}
> But JsonCodec.cc outputs \U



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)