You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Igor Guzenko (Jira)" <ji...@apache.org> on 2019/10/21 09:20:00 UTC
[jira] [Assigned] (DRILL-1999) Drill should expose the Parquet
logical schema rather than the physical schema
[ https://issues.apache.org/jira/browse/DRILL-1999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Igor Guzenko reassigned DRILL-1999:
-----------------------------------
Assignee: Igor Guzenko
> Drill should expose the Parquet logical schema rather than the physical schema
> ------------------------------------------------------------------------------
>
> Key: DRILL-1999
> URL: https://issues.apache.org/jira/browse/DRILL-1999
> Project: Apache Drill
> Issue Type: Improvement
> Components: Storage - Parquet
> Reporter: Ramana Inukonda Nagaraj
> Assignee: Igor Guzenko
> Priority: Major
> Fix For: Future
>
> Attachments: hive_alltypes.parquet
>
>
> Created a parquet file in hive having the following DDL
> hive> desc alltypesparquet;
> OK
> c1 int
> c2 boolean
> c3 double
> c4 string
> c5 array<int>
> c6 map<int,string>
> c7 map<string,string>
> c8 struct<r:string,s:int,t:double>
> c9 tinyint
> c10 smallint
> c11 float
> c12 bigint
> c13 array<array<string>>
> c15 struct<r:int,s:struct<a:int,b:string>>
> c16 array<struct<m:map<string,string>,n:int>>
> Time taken: 0.076 seconds, Fetched: 15 row(s)
> column5 which is an array of integers shows up as a bag when querying through drill
> 0: jdbc:drill:> select c5 from `/user/hive/warehouse/alltypesparquet`;
> +------------+
> | c5 |
> +------------+
> | {"bag":[]} |
> | {"bag":[]} |
> | {"bag":[{"array_element":1},{"array_element":2}]} |
> +------------+
> 3 rows selected (0.085 seconds)
> While from hive
> hive> select c5 from alltypesparquet;
> OK
> NULL
> NULL
> [1,2]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)