You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hive.apache.org by Krzysztof Adamski <ad...@gmail.com> on 2015/08/31 15:37:22 UTC

Hive serde parquet - problem with case sensitive column names in metastore

Hi guys,

I have a problem after the latest hortonworks upgrade (probably due to
https://issues.apache.org/jira/browse/HIVE-7554)

I have a lot of parquet files with a column name request_URL.
However a hive now stores the table with lowercase column names in
metastore and I am unable to query the tables.

Any ideas how to fix that? SERDEPROPERTIES?

Error: java.io.IOException: java.lang.IllegalArgumentException: Column
[request_url] was not found in schema! (state=,code=0)

Tried with the following serde property, but it doesn't work
WITH SERDEPROPERTIES
 ('request_url'='request_URL')

 CREATE EXTERNAL TABLE `test_table`(
   `timestamp` bigint,
   `source_ip` int,
   `request_url` string,
   `protocol` string
   )
 PARTITIONED BY (
   `year` string,
   `month` string,
   `day` string)
 ROW FORMAT SERDE
   'org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe'
WITH SERDEPROPERTIES
 ('request_url'='request_URL')
 STORED AS INPUTFORMAT
   'org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat'
 OUTPUTFORMAT
   'org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat'
 LOCATION
   'hdfs://cluster/logparquet/';

Thanks.
Krzysztof