You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Vitalii Diravka (JIRA)" <ji...@apache.org> on 2018/07/13 17:28:00 UTC
[jira] [Commented] (DRILL-4337) Drill fails to read INT96 fields
from hive generated parquet files
[ https://issues.apache.org/jira/browse/DRILL-4337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16543482#comment-16543482 ]
Vitalii Diravka commented on DRILL-4337:
----------------------------------------
I have reproduced the issue only with dataset from DRILL-5495. The issue is solved in context of that Jira.
> Drill fails to read INT96 fields from hive generated parquet files
> ------------------------------------------------------------------
>
> Key: DRILL-4337
> URL: https://issues.apache.org/jira/browse/DRILL-4337
> Project: Apache Drill
> Issue Type: Bug
> Reporter: Rahul Challapalli
> Assignee: Vitalii Diravka
> Priority: Blocker
> Fix For: 1.14.0
>
> Attachments: hive1_fewtypes_null.parquet
>
>
> git.commit.id.abbrev=576271d
> Cluster : 2 nodes running MaprFS 4.1
> The data file used in the below table is generated from hive. Below is output from running the same query multiple times.
> {code}
> 0: jdbc:drill:zk=10.10.100.190:5181> select timestamp_col from hive1_fewtypes_null;
> Error: SYSTEM ERROR: NegativeArraySizeException
> Fragment 0:0
> [Error Id: 5517e983-ccae-4c96-b09c-30f331919e56 on qa-node191.qa.lab:31010] (state=,code=0)
> 0: jdbc:drill:zk=10.10.100.190:5181> select timestamp_col from hive1_fewtypes_null;
> Error: SYSTEM ERROR: IllegalArgumentException: Reading past RLE/BitPacking stream.
> Fragment 0:0
> [Error Id: 94ed5996-d2ac-438d-b460-c2d2e41bdcc3 on qa-node191.qa.lab:31010] (state=,code=0)
> 0: jdbc:drill:zk=10.10.100.190:5181> select timestamp_col from hive1_fewtypes_null;
> Error: SYSTEM ERROR: ArrayIndexOutOfBoundsException: 0
> Fragment 0:0
> [Error Id: 41dca093-571e-49e5-a2ab-fd69210b143d on qa-node191.qa.lab:31010] (state=,code=0)
> 0: jdbc:drill:zk=10.10.100.190:5181> select timestamp_col from hive1_fewtypes_null;
> +----------------+
> | timestamp_col |
> +----------------+
> | null |
> | [B@7c766115 |
> | [B@3fdfe989 |
> | null |
> | [B@55d4222 |
> | [B@2da0c8ee |
> | [B@16e798a9 |
> | [B@3ed78afe |
> | [B@38e649ed |
> | [B@16ff83ca |
> | [B@61254e91 |
> | [B@5849436a |
> | [B@31e9116e |
> | [B@3c77665b |
> | [B@42e0ff60 |
> | [B@419e19ed |
> | [B@72b83842 |
> | [B@1c75afe5 |
> | [B@726ef1fb |
> | [B@51d0d06e |
> | [B@64240fb8 |
> +----------------
> {code}
> Attached the log, hive ddl used to generate the parquet file and the parquet file itself
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)