You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Daniel Weeks (JIRA)" <ji...@apache.org> on 2014/08/20 18:06:25 UTC

[jira] [Updated] (HIVE-7800) Parqet Column Index Access Schema Size Checking

     [ https://issues.apache.org/jira/browse/HIVE-7800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Daniel Weeks updated HIVE-7800:
-------------------------------

    Status: Patch Available  (was: Open)

Included patch is a trivial fix that simply checks for both the existence of the column in the parquet file as well as checking the column index position to make sure the file contains such a position.

In the event the check fails, the column is not included and null values are produced for the missing column, which is the expected behavior.

> Parqet Column Index Access Schema Size Checking
> -----------------------------------------------
>
>                 Key: HIVE-7800
>                 URL: https://issues.apache.org/jira/browse/HIVE-7800
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 0.14.0
>            Reporter: Daniel Weeks
>            Assignee: Daniel Weeks
>         Attachments: HIVE-7800.1.patch
>
>
> In the case that a parquet formatted table has partitions where the files have different size schema, using column index access can result in an index out of bounds exception.



--
This message was sent by Atlassian JIRA
(v6.2#6252)