You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "SHIMA Tatsuya (Jira)" <ji...@apache.org> on 2022/07/15 16:53:00 UTC
[jira] [Updated] (ARROW-17089) [Python] Use `.arrow` as extension for IPC file dataset
[ https://issues.apache.org/jira/browse/ARROW-17089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
SHIMA Tatsuya updated ARROW-17089:
----------------------------------
Description:
Same as ARROW-17088
As noted in the following document, the recommended extension for IPC files is now `.arrow`.
> We recommend the “.arrow” extension for files created with this format.
https://arrow.apache.org/docs/format/Columnar.html#ipc-file-format
However, currently when writing a dataset with the {{pyarrow.dataset.write_dataset}} function, the default extension is {{.feather}} when {{arrow}} or {{ipc}} or {{feather}} is selected.
https://github.com/apache/arrow/blob/b8067151db9bfc04860285fdd8b5e73703346037/python/pyarrow/_dataset.pyx#L1149-L1151
was:
Same as ARROW-17088
As noted in the following document, the recommended extension for IPC files is now `.arrow`.
> We recommend the “.arrow” extension for files created with this format.
https://arrow.apache.org/docs/format/Columnar.html#ipc-file-format
However, currently when writing a dataset with the {{pyarrow.dataset.write_dataset}} function, the default extension is {{.feather}} when {{arrow}} or {{ipc}} or {{feather}} is selected.
https://github.com/apache/arrow/blob/b8067151db9bfc04860285fdd8b5e73703346037/python/pyarrow/_dataset.pyx#L1149-L1151
> [Python] Use `.arrow` as extension for IPC file dataset
> -------------------------------------------------------
>
> Key: ARROW-17089
> URL: https://issues.apache.org/jira/browse/ARROW-17089
> Project: Apache Arrow
> Issue Type: Improvement
> Components: Python
> Affects Versions: 8.0.0
> Reporter: SHIMA Tatsuya
> Priority: Major
>
> Same as ARROW-17088
> As noted in the following document, the recommended extension for IPC files is now `.arrow`.
> > We recommend the “.arrow” extension for files created with this format.
> https://arrow.apache.org/docs/format/Columnar.html#ipc-file-format
> However, currently when writing a dataset with the {{pyarrow.dataset.write_dataset}} function, the default extension is {{.feather}} when {{arrow}} or {{ipc}} or {{feather}} is selected.
> https://github.com/apache/arrow/blob/b8067151db9bfc04860285fdd8b5e73703346037/python/pyarrow/_dataset.pyx#L1149-L1151
--
This message was sent by Atlassian Jira
(v8.20.10#820010)