You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Wes McKinney (JIRA)" <ji...@apache.org> on 2019/02/08 05:19:00 UTC
[jira] [Updated] (ARROW-4359) Column metadata is not saved or
loaded in parquet
[ https://issues.apache.org/jira/browse/ARROW-4359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Wes McKinney updated ARROW-4359:
--------------------------------
Component/s: Python
> Column metadata is not saved or loaded in parquet
> -------------------------------------------------
>
> Key: ARROW-4359
> URL: https://issues.apache.org/jira/browse/ARROW-4359
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Reporter: Seb Fru
> Priority: Major
>
> Hi all,
> a while ago I posted this issue:
> {color:#333333}https://issues.apache.org/jira/browse/ARROW-3866{color}
> {color:#333333}While working with Pyarrow I encountered another potential bug related to column metadata: If I create a table containing columns with metadata everything is fine. But after I save the table to parquet and load it back as a table using pq.read_table, the column metadata is gone.{color}
>
> {color:#333333}As of now I can not say yet whether the metadata is not saved correctly or not loaded correctly, as I have no idea how to verify it. Unfortunately I also don't have the time try a lot, but I wanted to let you know anyway. The mentioned issue can be used as example, just add the following lines:{color}
>
> >>> pq.write_table(tab, path)
> >>> tab2 = pq.read_table(path)
> >>> tab2.column(0).field.metadata
>
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)