You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Csaba Ringhofer (Jira)" <ji...@apache.org> on 2022/08/19 15:35:00 UTC
[jira] [Resolved] (IMPALA-9579) Read support for BINARY in ORC
[ https://issues.apache.org/jira/browse/IMPALA-9579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Csaba Ringhofer resolved IMPALA-9579.
-------------------------------------
Resolution: Fixed
> Read support for BINARY in ORC
> ------------------------------
>
> Key: IMPALA-9579
> URL: https://issues.apache.org/jira/browse/IMPALA-9579
> Project: IMPALA
> Issue Type: Sub-task
> Components: Backend
> Reporter: Csaba Ringhofer
> Assignee: Csaba Ringhofer
> Priority: Major
> Labels: orc
>
> ORC has its own BINARY type, which has some differences compared to STRING/VARCHAR/CHAR, as BINARY:
> - doesn't use dictionary encoding
> - doesn't store min/max values in the statistics
> The c++ library uses the same ColumnReader as for STRING, so the implementation efforts should be minimal: https://github.com/apache/orc/blob/a9ec6a2e39ed71ef8a2d874df14700956aa847be/c%2B%2B/src/ColumnReader.cc#L1752
--
This message was sent by Atlassian Jira
(v8.20.10#820010)