You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Neville Dipale (Jira)" <ji...@apache.org> on 2021/01/10 06:46:00 UTC
[jira] [Updated] (ARROW-6189) [Rust] [Parquet] Plain encoded
boolean column chunks limited to 2048 values
[ https://issues.apache.org/jira/browse/ARROW-6189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Neville Dipale updated ARROW-6189:
----------------------------------
Fix Version/s: 4.0.0
> [Rust] [Parquet] Plain encoded boolean column chunks limited to 2048 values
> ---------------------------------------------------------------------------
>
> Key: ARROW-6189
> URL: https://issues.apache.org/jira/browse/ARROW-6189
> Project: Apache Arrow
> Issue Type: Bug
> Components: Rust
> Affects Versions: 0.14.1
> Reporter: Simon Jones
> Priority: Major
> Fix For: 4.0.0
>
>
> encoding::PlainEncoder::new creates a BitWriter with 256 bytes of storage, which limits the data page size that can be used.
> I suggest that in
> {{impl Encoder<BoolType> for PlainEncoder<BoolType>}}
> the return value of put_value is tested and the BitWriter flushed+cleared whenever it runs out of space.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)