You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ignite.apache.org by "Alexei Scherbakov (JIRA)" <ji...@apache.org> on 2016/05/10 16:19:12 UTC

[jira] [Commented] (IGNITE-3098) UTF-16 surrogate pairs are not properly serialized by BinaryMarshaller

    [ https://issues.apache.org/jira/browse/IGNITE-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15278383#comment-15278383 ] 

Alexei Scherbakov commented on IGNITE-3098:
-------------------------------------------

Pull request https://github.com/apache/ignite/pull/699
Waiting for test results.

> UTF-16 surrogate pairs are not properly serialized by BinaryMarshaller
> ----------------------------------------------------------------------
>
>                 Key: IGNITE-3098
>                 URL: https://issues.apache.org/jira/browse/IGNITE-3098
>             Project: Ignite
>          Issue Type: Bug
>            Reporter: Denis Magda
>            Assignee: Alexei Scherbakov
>              Labels: important
>         Attachments: StringTest.java
>
>
> BinaryMarshaller is unable to properly serialize surrogate symbols (read here [1] paragraph "invalid code points") because it relies on String.getBytes("UTF-8") method [2].
> However Optimized and JDK marshalers can properly handle this symbols since they rely on {{ObjectOutputStream.writeUTF()}} method.
> [1] https://en.wikipedia.org/wiki/UTF-8
> [2] https://community.oracle.com/thread/1164397?start=0&tstart=0



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)