You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "SHIMA Tatsuya (Jira)" <ji...@apache.org> on 2022/07/15 16:53:00 UTC

[jira] [Updated] (ARROW-17089) [Python] Use `.arrow` as extension for IPC file dataset

     [ https://issues.apache.org/jira/browse/ARROW-17089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

SHIMA Tatsuya updated ARROW-17089:
----------------------------------
    Description: 
Same as ARROW-17088

As noted in the following document, the recommended extension for IPC files is now `.arrow`.

> We recommend the “.arrow” extension for files created with this format.
https://arrow.apache.org/docs/format/Columnar.html#ipc-file-format

However, currently when writing a dataset with the {{pyarrow.dataset.write_dataset}} function, the default extension is {{.feather}} when {{arrow}} or {{ipc}} or {{feather}} is selected.
https://github.com/apache/arrow/blob/b8067151db9bfc04860285fdd8b5e73703346037/python/pyarrow/_dataset.pyx#L1149-L1151

  was:
Same as ARROW-17088

As noted in the following document, the recommended extension for IPC files is now `.arrow`.

> We recommend the “.arrow” extension for files created with this format.
https://arrow.apache.org/docs/format/Columnar.html#ipc-file-format

However, currently when writing a dataset with the {{pyarrow.dataset.write_dataset}} function, the default extension is {{.feather}} when {{arrow}} or {{ipc}} or {{feather}} is selected.

https://github.com/apache/arrow/blob/b8067151db9bfc04860285fdd8b5e73703346037/python/pyarrow/_dataset.pyx#L1149-L1151


> [Python] Use `.arrow` as extension for IPC file dataset
> -------------------------------------------------------
>
>                 Key: ARROW-17089
>                 URL: https://issues.apache.org/jira/browse/ARROW-17089
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>    Affects Versions: 8.0.0
>            Reporter: SHIMA Tatsuya
>            Priority: Major
>
> Same as ARROW-17088
> As noted in the following document, the recommended extension for IPC files is now `.arrow`.
> > We recommend the “.arrow” extension for files created with this format.
> https://arrow.apache.org/docs/format/Columnar.html#ipc-file-format
> However, currently when writing a dataset with the {{pyarrow.dataset.write_dataset}} function, the default extension is {{.feather}} when {{arrow}} or {{ipc}} or {{feather}} is selected.
> https://github.com/apache/arrow/blob/b8067151db9bfc04860285fdd8b5e73703346037/python/pyarrow/_dataset.pyx#L1149-L1151



--
This message was sent by Atlassian Jira
(v8.20.10#820010)