You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Neville Dipale (Jira)" <ji...@apache.org> on 2021/01/10 06:46:00 UTC

[jira] [Updated] (ARROW-6189) [Rust] [Parquet] Plain encoded boolean column chunks limited to 2048 values

     [ https://issues.apache.org/jira/browse/ARROW-6189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Neville Dipale updated ARROW-6189:
----------------------------------
    Fix Version/s: 4.0.0

> [Rust] [Parquet] Plain encoded boolean column chunks limited to 2048 values
> ---------------------------------------------------------------------------
>
>                 Key: ARROW-6189
>                 URL: https://issues.apache.org/jira/browse/ARROW-6189
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Rust
>    Affects Versions: 0.14.1
>            Reporter: Simon Jones
>            Priority: Major
>             Fix For: 4.0.0
>
>
> encoding::PlainEncoder::new creates a BitWriter with 256 bytes of storage, which limits the data page size that can be used. 
> I suggest that in
> {{impl Encoder<BoolType> for PlainEncoder<BoolType>}}
> the return value of put_value is tested and the BitWriter flushed+cleared whenever it runs out of space.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)