You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Julien Le Dem (JIRA)" <ji...@apache.org> on 2015/10/23 20:47:27 UTC
[jira] [Created] (DRILL-3972) Vectorize Parquet Writer
Julien Le Dem created DRILL-3972:
------------------------------------
Summary: Vectorize Parquet Writer
Key: DRILL-3972
URL: https://issues.apache.org/jira/browse/DRILL-3972
Project: Apache Drill
Issue Type: Bug
Components: Storage - Parquet
Reporter: Julien Le Dem
Currently the [ParquetRecordWriter|https://github.com/apache/drill/blob/a98da39dd5a8fa368afd8765f4e981826bbfcc0f/exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/ParquetRecordWriter.java] receives one record at a time and then turns that into columns.
Which means we convert from Drill columns to rows and then to Parquet columns.
Instead we could directly convert the Drill columns into Parquet columns in a vectorized manner.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)