You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@hive.apache.org by "Dong Chen (JIRA)" <ji...@apache.org> on 2015/04/08 07:02:12 UTC

[jira] [Updated] (HIVE-10016) Remove duplicated Hive table schema parsing in DataWritableReadSupport

     [ https://issues.apache.org/jira/browse/HIVE-10016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dong Chen updated HIVE-10016:
-----------------------------
    Attachment: HIVE-10016.patch

Rebased to trunk. There are a little changes for the patch to resolve conflict.

> Remove duplicated Hive table schema parsing in DataWritableReadSupport
> ----------------------------------------------------------------------
>
>                 Key: HIVE-10016
>                 URL: https://issues.apache.org/jira/browse/HIVE-10016
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Dong Chen
>            Assignee: Dong Chen
>         Attachments: HIVE-10016-parquet.patch, HIVE-10016.1-parquet.patch, HIVE-10016.patch
>
>
> In {{DataWritableReadSupport.init()}}, the table schema is created and its string format is set in conf. When construct the {{ParquetRecordReaderWrapper}} , the schema is fetched from conf and parsed several times.
> We could remove these schema parsing, and improve the speed of getRecordReader  a bit.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)