You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Ferdinand Xu (JIRA)" <ji...@apache.org> on 2016/09/22 07:14:20 UTC

[jira] [Updated] (HIVE-14815) Support vectorization for Parquet

     [ https://issues.apache.org/jira/browse/HIVE-14815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ferdinand Xu updated HIVE-14815:
--------------------------------
    Description: Parquet doesn't provide a vectorized reader which can be used by Hive directly. Also for Decimal Column batch, it consists of a batch of HiveDecimal which is a Hive type which is unknown for Parquet. To support Hive vectorization execution engine in Hive, we have to implement the vectorized Parquet reader in Hive side. To limit the performance impacts, we need to implement a page level vectorized reader.

> Support vectorization for Parquet
> ---------------------------------
>
>                 Key: HIVE-14815
>                 URL: https://issues.apache.org/jira/browse/HIVE-14815
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Ferdinand Xu
>            Assignee: Ferdinand Xu
>
> Parquet doesn't provide a vectorized reader which can be used by Hive directly. Also for Decimal Column batch, it consists of a batch of HiveDecimal which is a Hive type which is unknown for Parquet. To support Hive vectorization execution engine in Hive, we have to implement the vectorized Parquet reader in Hive side. To limit the performance impacts, we need to implement a page level vectorized reader.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)