You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Joris Van den Bossche (Jira)" <ji...@apache.org> on 2020/09/09 13:06:00 UTC
[jira] [Created] (ARROW-9952) [Python] Use pyarrow.dataset writing
for pq.write_to_dataset
Joris Van den Bossche created ARROW-9952:
--------------------------------------------
Summary: [Python] Use pyarrow.dataset writing for pq.write_to_dataset
Key: ARROW-9952
URL: https://issues.apache.org/jira/browse/ARROW-9952
Project: Apache Arrow
Issue Type: Improvement
Components: Python
Reporter: Joris Van den Bossche
Fix For: 2.0.0
Now ARROW-9658 and ARROW-9893 are in, we can explore using the {{pyarrow.dataset}} writing capabilities in {{parquet.write_to_dataset}}.
Similarly as was done in {{pq.read_table}}, we could initially have a keyword to switch between both implementations, eventually defaulting to the new datasets one, and to deprecated the old (inefficient) python implementation.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)