You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Joris Van den Bossche (Jira)" <ji...@apache.org> on 2020/09/09 13:06:00 UTC

[jira] [Created] (ARROW-9952) [Python] Use pyarrow.dataset writing for pq.write_to_dataset

Joris Van den Bossche created ARROW-9952:
--------------------------------------------

             Summary: [Python] Use pyarrow.dataset writing for pq.write_to_dataset
                 Key: ARROW-9952
                 URL: https://issues.apache.org/jira/browse/ARROW-9952
             Project: Apache Arrow
          Issue Type: Improvement
          Components: Python
            Reporter: Joris Van den Bossche
             Fix For: 2.0.0


Now ARROW-9658 and ARROW-9893 are in, we can explore using the {{pyarrow.dataset}} writing capabilities in {{parquet.write_to_dataset}}.

Similarly as was done in {{pq.read_table}}, we could initially have a keyword to switch between both implementations, eventually defaulting to the new datasets one, and to deprecated the old (inefficient) python implementation.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)