You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Csaba Ringhofer (Jira)" <ji...@apache.org> on 2022/08/19 15:35:00 UTC

[jira] [Resolved] (IMPALA-9579) Read support for BINARY in ORC

     [ https://issues.apache.org/jira/browse/IMPALA-9579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Csaba Ringhofer resolved IMPALA-9579.
-------------------------------------
    Resolution: Fixed

> Read support for BINARY in ORC
> ------------------------------
>
>                 Key: IMPALA-9579
>                 URL: https://issues.apache.org/jira/browse/IMPALA-9579
>             Project: IMPALA
>          Issue Type: Sub-task
>          Components: Backend
>            Reporter: Csaba Ringhofer
>            Assignee: Csaba Ringhofer
>            Priority: Major
>              Labels: orc
>
> ORC has its own BINARY type, which has some differences compared to STRING/VARCHAR/CHAR, as BINARY:
> - doesn't use dictionary encoding
> - doesn't store min/max values in the statistics
> The c++ library uses the same ColumnReader as for STRING, so the implementation efforts should be minimal: https://github.com/apache/orc/blob/a9ec6a2e39ed71ef8a2d874df14700956aa847be/c%2B%2B/src/ColumnReader.cc#L1752



--
This message was sent by Atlassian Jira
(v8.20.10#820010)