You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Aaron Dossett (JIRA)" <ji...@apache.org> on 2015/09/28 20:26:04 UTC
[jira] [Created] (HIVE-11977) Hive should handle an external avro
table with zero length files present
Aaron Dossett created HIVE-11977:
------------------------------------
Summary: Hive should handle an external avro table with zero length files present
Key: HIVE-11977
URL: https://issues.apache.org/jira/browse/HIVE-11977
Project: Hive
Issue Type: Bug
Reporter: Aaron Dossett
If a zero length file is in the top level directory housing an external avro table, all hive queries on the table fail.
This issue is that org.apache.hadoop.hive.ql.io.avro.AvroGenericRecordReader creates a new org.apache.avro.file.DataFileReader and DataFileReader throws an exception when trying to read an empty file (because the empty file lacks the magic number marking it as avro).
AvroGenericRecordReader should detect an empty file and then behave reasonably.
Caused by: java.io.IOException: Not a data file.
at org.apache.avro.file.DataFileStream.initialize(DataFileStream.java:102)
at org.apache.avro.file.DataFileReader.<init>(DataFileReader.java:97)
at org.apache.hadoop.hive.ql.io.avro.AvroGenericRecordReader.<init>(AvroGenericRecordReader.java:81)
at org.apache.hadoop.hive.ql.io.avro.AvroContainerInputFormat.getRecordReader(AvroContainerInputFormat.java:51)
at org.apache.hadoop.hive.ql.io.HiveInputFormat.getRecordReader(HiveInputFormat.java:246)
... 25 more
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)