You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Jonathan Ellis (JIRA)" <ji...@apache.org> on 2010/07/08 17:29:52 UTC

[jira] Commented: (CASSANDRA-1235) BytesType and batch mutate causes encoded bytes of non-printable characters to be dropped

    [ https://issues.apache.org/jira/browse/CASSANDRA-1235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12886346#action_12886346 ] 

Jonathan Ellis commented on CASSANDRA-1235:
-------------------------------------------

I believe that

                return new String(buffer, 0, len);

will treat buffer as UTF-16, not UTF-8.  you want

                return new String(buffer, 0, len, "UTF8");

I'm not at all sure that longToPrefixCoded is going to generate valid UTF-8, either.

> BytesType and batch mutate causes encoded bytes of non-printable characters to be dropped
> -----------------------------------------------------------------------------------------
>
>                 Key: CASSANDRA-1235
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-1235
>             Project: Cassandra
>          Issue Type: Bug
>    Affects Versions: 0.6
>         Environment: Java 1.6 sun JDK 
> Java(TM) SE Runtime Environment (build 1.6.0_20-b02)
> Java HotSpot(TM) 64-Bit Server VM (build 16.3-b01, 
> Ubuntu 10.04 64 bit
>            Reporter: Todd Nine
>            Priority: Critical
>             Fix For: 0.6.4
>
>         Attachments: TestEncodedKeys.java
>
>
> When running the two tests, individual column insert works with the values generated.  However, batch insert with the same values causes an encoding failure on the key.  It appears bytes are dropped from the end of the byte array that represents the key value.  See the attached unit test

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.