You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@orc.apache.org by "Dongjoon Hyun (JIRA)" <ji...@apache.org> on 2017/08/18 20:42:00 UTC

[jira] [Commented] (ORC-233) Allow `orc.include.columns` to be empty

    [ https://issues.apache.org/jira/browse/ORC-233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16133599#comment-16133599 ] 

Dongjoon Hyun commented on ORC-233:
-----------------------------------

Hi, [~owen.omalley] and [~bikassaha].
Apache Spark starts to depend on Apache ORC 1.4.0. Recently, I try to replace old ORC completely without regressions.
This issue is a regression of Apache ORC 1.4.0 from old ORC.

https://github.com/apache/spark/blob/master/sql/hive/src/test/scala/org/apache/spark/sql/hive/orc/OrcQuerySuite.scala#L595-L614

> Allow `orc.include.columns` to be empty
> ---------------------------------------
>
>                 Key: ORC-233
>                 URL: https://issues.apache.org/jira/browse/ORC-233
>             Project: ORC
>          Issue Type: Bug
>          Components: Java
>    Affects Versions: 1.4.0
>            Reporter: Dongjoon Hyun
>
> Apache ORC should support returning all NULLs by the following.
> {code}
> conf.set(OrcConf.INCLUDE_COLUMNS.getAttribute, "")
> {code}
> Currently, it raises the following exceptions.
> {code}
> For input string: ""
> java.lang.NumberFormatException: For input string: ""
> 	at java.lang.NumberFormatException.forInputString(NumberFormatException.java:65)
> 	at java.lang.Integer.parseInt(Integer.java:592)
> 	at java.lang.Integer.parseInt(Integer.java:615)
> 	at org.apache.orc.mapred.OrcInputFormat.parseInclude(OrcInputFormat.java:69)
> {code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)