You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Joris Van den Bossche (Jira)" <ji...@apache.org> on 2021/02/27 12:59:00 UTC

[jira] [Updated] (ARROW-9682) [Python] Unable to specify the partition style with pq.write_to_dataset

     [ https://issues.apache.org/jira/browse/ARROW-9682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Joris Van den Bossche updated ARROW-9682:
-----------------------------------------
    Component/s: Python

> [Python] Unable to specify the partition style with pq.write_to_dataset
> -----------------------------------------------------------------------
>
>                 Key: ARROW-9682
>                 URL: https://issues.apache.org/jira/browse/ARROW-9682
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>    Affects Versions: 1.0.0
>         Environment: Ubuntu 18.04
> Python 3.7
>            Reporter: Lance Dacey
>            Priority: Major
>              Labels: dataset-parquet-write, parquet, parquetWriter
>
> I am able to import and test DirectoryPartitioning but I am not able to figure out a way to write a dataset using this feature. It seems like write_to_dataset defaults to the "hive" style. Is there a way to test this?
> {code:java}
> from pyarrow.dataset import DirectoryPartitioning
> partitioning = DirectoryPartitioning(pa.schema([("year", pa.int16()), ("month", pa.int8()), ("day", pa.int8())]))
> print(partitioning.parse("/2009/11/3"))
> {code}
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)