You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Ryan Blue (JIRA)" <ji...@apache.org> on 2015/05/16 22:41:59 UTC

[jira] [Created] (PARQUET-286) Avro object model should use Utf8

Ryan Blue created PARQUET-286:
---------------------------------

             Summary: Avro object model should use Utf8
                 Key: PARQUET-286
                 URL: https://issues.apache.org/jira/browse/PARQUET-286
             Project: Parquet
          Issue Type: Improvement
          Components: parquet-avro
    Affects Versions: 1.6.0
            Reporter: Ryan Blue
            Assignee: Ryan Blue
             Fix For: 2.0.0


The parquet-avro object model currently only uses String for binary/utf8 data. It should use Utf8, which doesn't copy data and require 2x the space whenever Avro would. That is by default in generic, unless avro.java.string is set to "string". Avro reflect always uses String, and both reflect and specific will respect the java-class property for stringable types.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)