You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Wes McKinney (JIRA)" <ji...@apache.org> on 2017/04/03 13:23:41 UTC

[jira] [Commented] (ARROW-760) [Python] document differences w.r.t. fastparquet

    [ https://issues.apache.org/jira/browse/ARROW-760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15953463#comment-15953463 ] 

Wes McKinney commented on ARROW-760:
------------------------------------

[~jreback] I just moved this JIRA to the Arrow side -- I think the differences mostly have to do with the Python API, but we can make a list of C++-only requirements that we need from parquet-cpp (e.g. an API in parquet_arrow for reading a single row group vs. the entire file)

> [Python] document differences w.r.t. fastparquet
> ------------------------------------------------
>
>                 Key: ARROW-760
>                 URL: https://issues.apache.org/jira/browse/ARROW-760
>             Project: Apache Arrow
>          Issue Type: Improvement
>            Reporter: Jeff Reback
>            Priority: Minor
>              Labels: doc
>
> differences in options and/or actual written file formats w.r.t. https://fastparquet.readthedocs.io/en/latest/
> - null handling
> - non-supported type handling
> - options that can be passed via top-level functions



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)