You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Neal Richardson (Jira)" <ji...@apache.org> on 2020/08/25 18:06:00 UTC

[jira] [Created] (ARROW-9854) [R] Support reading/writing data to/from S3

Neal Richardson created ARROW-9854:
--------------------------------------

             Summary: [R] Support reading/writing data to/from S3
                 Key: ARROW-9854
                 URL: https://issues.apache.org/jira/browse/ARROW-9854
             Project: Apache Arrow
          Issue Type: New Feature
          Components: R
            Reporter: Neal Richardson
            Assignee: Neal Richardson
             Fix For: 2.0.0


Current S3 support is limited to (1) being able to instantiate an S3FileSystem object, primarily from a URI, and (2) ability to open_dataset from an S3 URI. Before widely declaring that we support S3 in R, we should be able to:

* download dataset (i.e. copy files/directory recursively)
* read_parquet/feather/etc. from S3 (use FileSystem->OpenInputFile(path))
* write_$FORMAT via FileSystem->OpenOutputStream(path)
* write_dataset
* for linux, an argument to install_arrow to help, assuming you've installed aws-sdk-cpp already (turn on ARROW_S3, AWSSDK_SOURCE=SYSTEM)
* testing with minio on CI
* set up a real test bucket and user for e2e testing
* update docs and vignettes



--
This message was sent by Atlassian Jira
(v8.3.4#803005)