You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Nicola Crane (Jira)" <ji...@apache.org> on 2022/04/07 21:03:00 UTC
[jira] [Updated] (ARROW-16144) [R] Write compressed data streams (particularly over S3)
[ https://issues.apache.org/jira/browse/ARROW-16144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Nicola Crane updated ARROW-16144:
---------------------------------
Summary: [R] Write compressed data streams (particularly over S3) (was: Write compressed data streams (particularly over S3))
> [R] Write compressed data streams (particularly over S3)
> --------------------------------------------------------
>
> Key: ARROW-16144
> URL: https://issues.apache.org/jira/browse/ARROW-16144
> Project: Apache Arrow
> Issue Type: Improvement
> Components: R
> Affects Versions: 7.0.0
> Reporter: Carl Boettiger
> Priority: Major
>
> The python bindings have `CompressedOutputStream`, but I don't see how we can do this on the R side (e.g. with `write_csv_arrow()`). It would be wonderful if we could both read and write compressed streams, particularly for CSV and particularly for remote filesystems, where this can provide considerable performance improvements.
> (For comparison, readr will write a compressed stream automatically based on the extension for the given filename, e.g. `readr::write_csv(data, "file.csv.gz")` or `write_csv("data.file.xz")` )
--
This message was sent by Atlassian Jira
(v8.20.1#820001)