You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Igor Guzenko (Jira)" <ji...@apache.org> on 2019/10/21 09:21:00 UTC

[jira] [Updated] (DRILL-1999) Drill should expose the Parquet logical schema rather than the physical schema

     [ https://issues.apache.org/jira/browse/DRILL-1999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Igor Guzenko updated DRILL-1999:
--------------------------------
    Fix Version/s:     (was: Future)
                   1.17.0

> Drill should expose the Parquet logical schema rather than the physical schema
> ------------------------------------------------------------------------------
>
>                 Key: DRILL-1999
>                 URL: https://issues.apache.org/jira/browse/DRILL-1999
>             Project: Apache Drill
>          Issue Type: Improvement
>          Components: Storage - Parquet
>            Reporter: Ramana Inukonda Nagaraj
>            Assignee: Igor Guzenko
>            Priority: Major
>             Fix For: 1.17.0
>
>         Attachments: hive_alltypes.parquet
>
>
> Created a parquet file in hive having the following DDL
> hive> desc alltypesparquet; 
> OK
> c1 int 
> c2 boolean 
> c3 double 
> c4 string 
> c5 array<int> 
> c6 map<int,string> 
> c7 map<string,string> 
> c8 struct<r:string,s:int,t:double>	
> c9 tinyint 
> c10 smallint 
> c11 float 
> c12 bigint 
> c13 array<array<string>>	
> c15 struct<r:int,s:struct<a:int,b:string>>	
> c16 array<struct<m:map<string,string>,n:int>>	
> Time taken: 0.076 seconds, Fetched: 15 row(s)
> column5 which is an array of integers shows up as a bag when querying through drill 
> 0: jdbc:drill:> select c5 from `/user/hive/warehouse/alltypesparquet`;
> +------------+
> |     c5     |
> +------------+
> | {"bag":[]} |
> | {"bag":[]} |
> | {"bag":[{"array_element":1},{"array_element":2}]} |
> +------------+
> 3 rows selected (0.085 seconds)
> While from hive
> hive> select c5 from alltypesparquet;  
> OK
> NULL
> NULL
> [1,2]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)