You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@orc.apache.org by "Riza Suminto (Jira)" <ji...@apache.org> on 2022/01/03 18:04:00 UTC

[jira] [Commented] (ORC-450) [C++] Support selecting list indices without materializing list items

    [ https://issues.apache.org/jira/browse/ORC-450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17468133#comment-17468133 ] 

Riza Suminto commented on ORC-450:
----------------------------------

Hello! I'd like to work on this issue.

I have an idea to introduce fourth interface in RowReaderOptions to select column from ORC file that is more selective than what we already have.

> [C++] Support selecting list indices without materializing list items
> ---------------------------------------------------------------------
>
>                 Key: ORC-450
>                 URL: https://issues.apache.org/jira/browse/ORC-450
>             Project: ORC
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Quanlong Huang
>            Priority: Major
>
> As we don't have a type id for index field of list type, if we just want to get the list indices, we have to select the type id of the whole array, which cause unneccessary materializaion on the array item.
> The offset stream is stored sperately with the content stream. We are able to just materialize the list indices.
> We may need new interfaces in RowReaderOptions to specifiy this.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)