You are viewing a plain text version of this content. The canonical link for it is here.

Posted to jira@arrow.apache.org by "Brian Kiefer (Jira)" <ji...@apache.org> on 2021/11/04 16:57:00 UTC

[jira] [Comment Edited] (ARROW-13781) [Python] Allow per column encoding in parquet writer

    [ https://issues.apache.org/jira/browse/ARROW-13781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17438836#comment-17438836 ] 

Brian Kiefer edited comment on ARROW-13781 at 11/4/21, 4:56 PM:
----------------------------------------------------------------

[~alenkaf] - I have a first pass at the code changes available. It compiled but crashed at runtime, then other priorities pulled me away. I'll throw it up on Github tonight and link to it here


was (Author: bkief):
[~alenkaf] - I have a first pass at the code changes available. It compiled but crashed at runtime, then other priorities pulled me away. I'll through it up on Github tonight and link to it here

> [Python] Allow per column encoding in parquet writer 
> -----------------------------------------------------
>
>                 Key: ARROW-13781
>                 URL: https://issues.apache.org/jira/browse/ARROW-13781
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Parquet, Python
>            Reporter: Brian Kiefer
>            Assignee: Alenka Frim
>            Priority: Minor
>
> Add a new parameter to `write_table` to allow parquet encodings to be defined on a per column basis. This should supercede use_dictionary and use_byte_stream_split.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)