You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@orc.apache.org by "Gang Wu (Jira)" <ji...@apache.org> on 2023/02/03 07:02:00 UTC

[jira] [Resolved] (ORC-1280) [C++] Implement block-based buffer to replace DataBuffer in the ColumnWriter

     [ https://issues.apache.org/jira/browse/ORC-1280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gang Wu resolved ORC-1280.
--------------------------
    Resolution: Fixed

> [C++] Implement block-based buffer to replace DataBuffer in the ColumnWriter
> ----------------------------------------------------------------------------
>
>                 Key: ORC-1280
>                 URL: https://issues.apache.org/jira/browse/ORC-1280
>             Project: ORC
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Gang Wu
>            Assignee: ZhangXin
>            Priority: Major
>
> The problem comes from a community discussion: [Huge memory taken for each field when exporting · Issue #1240 · apache/orc (github.com)|https://github.com/apache/orc/issues/1240]. The basic idea is to implement a block-based buffer management and replace the DataBuffer within the ColumnWriter to automatically adapt its size to accommodate data before compression.
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)