You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "John Jenkins (JIRA)" <ji...@apache.org> on 2017/02/01 14:02:51 UTC

[jira] [Created] (PARQUET-852) Slowly ramp up sizes of byte[] in ByteBasedBitPackingEncoder

John Jenkins created PARQUET-852:
------------------------------------

             Summary: Slowly ramp up sizes of byte[] in ByteBasedBitPackingEncoder
                 Key: PARQUET-852
                 URL: https://issues.apache.org/jira/browse/PARQUET-852
             Project: Parquet
          Issue Type: Improvement
            Reporter: John Jenkins
            Priority: Minor


The current allocation policy for ByteBasedBitPackingEncoder is to allocate 64KB * #bits up-front. As similarly observed in [PARQUET-580], this can lead to significant memory overheads for high-fanout scenarios (many columns and/or open files, in my case using BooleanPlainValuesWriter).

As done in [PARQUET-585], I'll follow up with a PR that starts with a smaller buffer and works its way up to a max.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)