You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/10/18 20:28:00 UTC

[jira] [Resolved] (ARROW-17524) The ORC reader method ReadStripe does not work when we specify fields to selected as a list of integers

     [ https://issues.apache.org/jira/browse/ARROW-17524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Antoine Pitrou resolved ARROW-17524.
------------------------------------
    Fix Version/s: 11.0.0
                       (was: 10.0.0)
       Resolution: Fixed

Issue resolved by pull request 13962
[https://github.com/apache/arrow/pull/13962]

> The ORC reader method ReadStripe does not work when we specify fields to selected as a list of integers
> -------------------------------------------------------------------------------------------------------
>
>                 Key: ARROW-17524
>                 URL: https://issues.apache.org/jira/browse/ARROW-17524
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: C++
>    Affects Versions: 8.0.1
>            Reporter: Louis Calot
>            Assignee: Louis Calot
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 11.0.0
>
>          Time Spent: 2h
>  Remaining Estimate: 0h
>
> I think there is a bug in the ORC reader : when we specify the fields indexes that we want to keep, it does not work correctly. Looking at the code, it seems to be because we do "includeTypes" in lieue of "include" when setting the ORC options.
> It can be problematic when we want to import an ORC table containing Union types as it will do an error at the import, even if we try not to import these specific fields.
> The definitions of the corresponding ORC methods are here :
> [https://github.com/apache/orc/blob/72220851cbde164a22706f8d47741fd1ad3db190/c%2B%2B/src/Options.hh#L185-L191]
> and
> [https://github.com/apache/orc/blob/72220851cbde164a22706f8d47741fd1ad3db190/c%2B%2B/src/Options.hh#L201-L207]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)