You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Krystal (JIRA)" <ji...@apache.org> on 2014/10/10 20:48:34 UTC
[jira] [Created] (DRILL-1524) Data from hive parquet table is displayed as "null" when select all columns

Krystal created DRILL-1524:
------------------------------

             Summary: Data from hive parquet table is displayed as "null" when select all columns 
                 Key: DRILL-1524
                 URL: https://issues.apache.org/jira/browse/DRILL-1524
             Project: Apache Drill
          Issue Type: Bug
    Affects Versions: 0.6.0
            Reporter: Krystal


git.commit.id.abbrev=42f0a7e

>From hive-13, I created a parquet table:
hive> create table voter_parquet(voter_id int,name string,age tinyint, registration string,contributions float,voterzone smallint,create_time string) stored as parquet; 
hive> insert overwrite table voter_parquet select * from voter;

I can select against this table from hive:
hive> select * from voter_parquet limit 5;
OK
1	nick miller	68	green	717.12	13809	2014-05-25 03:41:54
2	ulysses white	48	green	840.06	19451	2014-07-30 08:03:11
3	holly garcia	18	democrat	128.2	8750	2014-09-15 02:33:11
4	victor thompson	61	independent	721.6	20462	2014-06-17 13:04:09
5	luke allen	39	socialist	800.22	25151	2015-02-01 02:02:37

I ran the same select from sqlline and got all nulls:

0: jdbc:drill:schema=hive> select * from voter_parquet limit 5;
+------------+------------+------------+--------------+---------------+------------+-------------+
|  voter_id  |    name    |    age     | registration | contributions | voterzone  | create_time |
+------------+------------+------------+--------------+---------------+------------+-------------+
| null       | null       | null       | null         | null          | null       | null        |
| null       | null       | null       | null         | null          | null       | null        |
| null       | null       | null       | null         | null          | null       | null        |
| null       | null       | null       | null         | null          | null       | null        |
| null       | null       | null       | null         | null          | null       | null        |
+------------+------------+------------+--------------+---------------+------------+-------------+

Same if I explicitly specify all the columns:
0: jdbc:drill:schema=hive> select voter_id, name, age, registration, contributions, voterzone, create_time from voter_parquet limit 2;
+------------+------------+------------+--------------+---------------+------------+-------------+
|  voter_id  |    name    |    age     | registration | contributions | voterzone  | create_time |
+------------+------------+------------+--------------+---------------+------------+-------------+
| null       | null       | null       | null         | null          | null       | null        |
| null       | null       | null       | null         | null          | null       | null        |
+------------+------------+------------+--------------+---------------+------------+-------------+

However, if I select a few columns, then the data displays correctly:
0: jdbc:drill:schema=hive> select voter_id, name, age, registration from voter_parquet limit 5;
+------------+------------+------------+--------------+
|  voter_id  |    name    |    age     | registration |
+------------+------------+------------+--------------+
| 1          | nick miller | 68         | green        |
| 2          | ulysses white | 48         | green        |
| 3          | holly garcia | 18         | democrat     |
| 4          | victor thompson | 61         | independent  |
| 5          | luke allen | 39         | socialist    |
+------------+------------+------------+--------------+

0: jdbc:drill:schema=hive> describe voter_parquet;
+-------------+------------+-------------+
| COLUMN_NAME | DATA_TYPE  | IS_NULLABLE |
+-------------+------------+-------------+
| voter_id    | INTEGER    | YES         |
| name        | VARCHAR    | YES         |
| age         | TINYINT    | YES         |
| registration | VARCHAR    | YES         |
| contributions | FLOAT      | YES         |
| voterzone   | SMALLINT   | YES         |
| create_time | VARCHAR    | YES         |
+-------------+------------+-------------+




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)