You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Prasanth Jayachandran (JIRA)" <ji...@apache.org> on 2016/09/29 20:12:20 UTC
[jira] [Reopened] (HIVE-13284) Make ORC Reader resilient to 0
length files
[ https://issues.apache.org/jira/browse/HIVE-13284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Prasanth Jayachandran reopened HIVE-13284:
------------------------------------------
This could still be a problem if Reader api is directly used. Hive filters 0 length files in OrcInputFormat during split generation and validateInput but if someone uses Reader api directly this could still be an issue. Ideally this should be fixed in ORC project. I will keep this open anyways for tracking.
> Make ORC Reader resilient to 0 length files
> -------------------------------------------
>
> Key: HIVE-13284
> URL: https://issues.apache.org/jira/browse/HIVE-13284
> Project: Hive
> Issue Type: Bug
> Affects Versions: 2.1.0
> Reporter: Prasanth Jayachandran
> Assignee: Prasanth Jayachandran
>
> HIVE-13040 creates 0 length ORC files. Reading such files will throw following exception. ORC is resilient to corrupt footers but not 0 length files.
> {code}
> Processing data file file:/app/warehouse/concat_incompat/000000_0 [length: 0]
> Exception in thread "main" java.lang.IndexOutOfBoundsException
> at java.nio.Buffer.checkIndex(Buffer.java:540)
> at java.nio.HeapByteBuffer.get(HeapByteBuffer.java:139)
> at org.apache.hadoop.hive.ql.io.orc.ReaderImpl.extractMetaInfoFromFooter(ReaderImpl.java:510)
> at org.apache.hadoop.hive.ql.io.orc.ReaderImpl.<init>(ReaderImpl.java:361)
> at org.apache.hadoop.hive.ql.io.orc.OrcFile.createReader(OrcFile.java:83)
> at org.apache.hadoop.hive.ql.io.orc.FileDump.getReader(FileDump.java:239)
> at org.apache.hadoop.hive.ql.io.orc.FileDump.printMetaDataImpl(FileDump.java:312)
> at org.apache.hadoop.hive.ql.io.orc.FileDump.printMetaData(FileDump.java:291)
> at org.apache.hadoop.hive.ql.io.orc.FileDump.main(FileDump.java:138)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:497)
> at org.apache.hadoop.util.RunJar.run(RunJar.java:221)
> at org.apache.hadoop.util.RunJar.main(RunJar.java:136)
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)