You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Neal Richardson (Jira)" <ji...@apache.org> on 2021/01/13 19:44:00 UTC
[jira] [Updated] (ARROW-3585) [Python] Update the documentation
about Schema & Metadata usage
[ https://issues.apache.org/jira/browse/ARROW-3585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Neal Richardson updated ARROW-3585:
-----------------------------------
Fix Version/s: (was: 3.0.0)
4.0.0
> [Python] Update the documentation about Schema & Metadata usage
> ---------------------------------------------------------------
>
> Key: ARROW-3585
> URL: https://issues.apache.org/jira/browse/ARROW-3585
> Project: Apache Arrow
> Issue Type: Task
> Components: Documentation
> Reporter: Daniel Haviv
> Assignee: Daniel Haviv
> Priority: Trivial
> Labels: beginner, documentation, easyfix, newbie, parquet
> Fix For: 4.0.0
>
> Original Estimate: 24h
> Remaining Estimate: 24h
>
> Reusing the Schema object from a Parquet file written with Spark with Pandas fails due to Schema mismatch.
> The culprit is in the metadata part of the schema which each component fills according to it's implementation. More details can be found here: [https://github.com/apache/arrow/issues/2805]
> The documentation should point that out.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)