You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Jonathan Keane (Jira)" <ji...@apache.org> on 2021/11/17 16:59:00 UTC

[jira] [Resolved] (ARROW-14640) [R] reading data from S3

     [ https://issues.apache.org/jira/browse/ARROW-14640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jonathan Keane resolved ARROW-14640.
------------------------------------
    Fix Version/s: 7.0.0
       Resolution: Fixed

Issue resolved by pull request 11691
[https://github.com/apache/arrow/pull/11691]

> [R] reading data from S3
> ------------------------
>
>                 Key: ARROW-14640
>                 URL: https://issues.apache.org/jira/browse/ARROW-14640
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: R
>    Affects Versions: 6.0.0
>            Reporter: Mikhail Tolmachev
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 7.0.0
>
>          Time Spent: 1h
>  Remaining Estimate: 0h
>
> I am trying to read data directly from S3. In my work pipeline I work under proxy, so I set system environments HTTP_PROXY and HTTPS_PROXY. Also I have set AWS_ACCESS_KEY_ID and AWS_SECRET_ACCESS_KEY.  I have constructed links for reading:
> {code:java}
> s3_uri <- paste0("s3://", accessKeyId, ":", URLencode(secretAccessKey, reserved = T),"@", bucketName, "/", path) 
> s3_uri_short <- paste0("s3://", bucketName, "/", path)
> bucketS3 <- arrow::s3_bucket(bucket = bucketName, access_key = accessKeyId,
>                        secret_key = URLencode(secretAccessKey, reserved = T)){code}
> But none of them worked for arrow::read_parquet
> {code:java}
> df <- arrow::read_parquet(s3_uri)
> df <- arrow::read_parquet(s3_uri_short) 
> df <- arrow::read_parquet(bucketS3$path(x = path)){code}
> Error
> {code:java}
>  IOError: When reading information for key AWS Error [code 15]: No response body. with address : {code}
> How can I fix the problem or maybe get more informative output ? Thanks for attention. 
> I knew about same issue https://issues.apache.org/jira/browse/ARROW-12126 but I can't reopen it.
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)