You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@orc.apache.org by "Lei Sun (Jira)" <ji...@apache.org> on 2021/09/16 00:42:00 UTC

[jira] [Created] (ORC-997) Share compress output buffer among TreeWriter

Lei Sun created ORC-997:
---------------------------

             Summary: Share compress output buffer among TreeWriter
                 Key: ORC-997
                 URL: https://issues.apache.org/jira/browse/ORC-997
             Project: ORC
          Issue Type: New Feature
          Components: compression, Java
            Reporter: Lei Sun


There's individual instance of `OutStream` within each TreeWriter created by `WriterContext#createStream` method. Within `OutStream`, there are totally 3 buffers: 
 * current: the regular input buffer holding uncompressed, unencrypted bytes.
 * compress: the output buffer holding compressed bytes 
 * overflow: same as "compress" but only used when the last compression output is larger than remaining capacity of compress buffer. 

Potentially the compress and overflow buffer doesn't have to be allocated individually within each OutStream object, but shared across all of them so to save memory. This ticket is trying to achieve that and obtain saving number with some sort of benchmarking(yet to be defined). 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)