You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@avro.apache.org by "Howard Tong (JIRA)" <ji...@apache.org> on 2018/07/06 05:31:00 UTC

[jira] [Created] (AVRO-2197) add a new constructor in DataFileReader & DataFileWriter with passing in Schema object

Howard Tong created AVRO-2197:
---------------------------------

             Summary: add a new constructor in DataFileReader & DataFileWriter with passing in Schema object
                 Key: AVRO-2197
                 URL: https://issues.apache.org/jira/browse/AVRO-2197
             Project: Avro
          Issue Type: Improvement
          Components: java
    Affects Versions: 1.8.2
            Reporter: Howard Tong


When processing big number of small files (with the same schema header), reading/writing each file need to read/write the head and parsing the json script, which is very slow. 

If adding a new constructor in DataFileReader and DataFileWriter that allows pass in already parsed Schema object/script,  then it will greatly improve the reading/writing performance for such cases. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)