You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Julien Le Dem (JIRA)" <ji...@apache.org> on 2017/05/12 22:11:04 UTC

[jira] [Resolved] (PARQUET-852) Slowly ramp up sizes of byte[] in ByteBasedBitPackingEncoder

     [ https://issues.apache.org/jira/browse/PARQUET-852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Julien Le Dem resolved PARQUET-852.
-----------------------------------
       Resolution: Fixed
    Fix Version/s: 1.10.0

Issue resolved by pull request 401
[https://github.com/apache/parquet-mr/pull/401]

> Slowly ramp up sizes of byte[] in ByteBasedBitPackingEncoder
> ------------------------------------------------------------
>
>                 Key: PARQUET-852
>                 URL: https://issues.apache.org/jira/browse/PARQUET-852
>             Project: Parquet
>          Issue Type: Improvement
>            Reporter: John Jenkins
>            Priority: Minor
>             Fix For: 1.10.0
>
>
> The current allocation policy for ByteBasedBitPackingEncoder is to allocate 64KB * #bits up-front. As similarly observed in [PARQUET-580], this can lead to significant memory overheads for high-fanout scenarios (many columns and/or open files, in my case using BooleanPlainValuesWriter).
> As done in [PARQUET-585], I'll follow up with a PR that starts with a smaller buffer and works its way up to a max.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)