You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Julien Le Dem (JIRA)" <ji...@apache.org> on 2015/10/23 20:47:27 UTC

[jira] [Created] (DRILL-3972) Vectorize Parquet Writer

Julien Le Dem created DRILL-3972:
------------------------------------

             Summary: Vectorize Parquet Writer
                 Key: DRILL-3972
                 URL: https://issues.apache.org/jira/browse/DRILL-3972
             Project: Apache Drill
          Issue Type: Bug
          Components: Storage - Parquet
            Reporter: Julien Le Dem


Currently the [ParquetRecordWriter|https://github.com/apache/drill/blob/a98da39dd5a8fa368afd8765f4e981826bbfcc0f/exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/ParquetRecordWriter.java] receives one record at a time and then turns that into columns.
Which means we convert from Drill columns to rows and then to Parquet columns.
Instead we could directly convert the Drill columns into Parquet columns in a vectorized manner.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)