You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Wes McKinney (JIRA)" <ji...@apache.org> on 2019/02/08 05:19:00 UTC

[jira] [Updated] (ARROW-4359) [Python][Parquet] Column metadata is not saved or loaded in parquet

     [ https://issues.apache.org/jira/browse/ARROW-4359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Wes McKinney updated ARROW-4359:
--------------------------------
    Summary: [Python][Parquet] Column metadata is not saved or loaded in parquet  (was: Column metadata is not saved or loaded in parquet)

> [Python][Parquet] Column metadata is not saved or loaded in parquet
> -------------------------------------------------------------------
>
>                 Key: ARROW-4359
>                 URL: https://issues.apache.org/jira/browse/ARROW-4359
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>            Reporter: Seb Fru
>            Priority: Major
>              Labels: parquet
>
> Hi all,
> a while ago I posted this issue:
> {color:#333333}https://issues.apache.org/jira/browse/ARROW-3866{color}
> {color:#333333}While working with Pyarrow I encountered another potential bug related to column metadata: If I create a table containing columns with metadata everything is fine. But after I save the table to parquet and load it back as a table using pq.read_table, the column metadata is gone.{color}
>  
> {color:#333333}As of now I can not say yet whether the metadata is not saved correctly or not loaded correctly, as I have no idea how to verify it. Unfortunately I also don't have the time try a lot, but I wanted to let you know anyway. The mentioned issue can be used as example, just add the following lines:{color}
>  
> >>> pq.write_table(tab, path)
> >>> tab2 = pq.read_table(path)
> >>> tab2.column(0).field.metadata
>  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)