You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Vladimir (Jira)" <ji...@apache.org> on 2022/09/15 14:15:00 UTC

[jira] [Created] (ARROW-17739) [Python] write_to_parquet with s3fs will fails on no permissions for CreateBucket even when bucket exists (regression in 8.0.0)

Vladimir created ARROW-17739:
--------------------------------

             Summary: [Python] write_to_parquet with s3fs will fails on no permissions for CreateBucket even when bucket exists (regression in 8.0.0)
                 Key: ARROW-17739
                 URL: https://issues.apache.org/jira/browse/ARROW-17739
             Project: Apache Arrow
          Issue Type: Bug
          Components: Python
    Affects Versions: 8.0.1, 9.0.0, 8.0.0
            Reporter: Vladimir


Hello

in version 8.0.0 and later the following bug was introduced: `write_to_dataset` will fail with the error "An error occurred (AccessDenied) when calling the CreateBucket operation: Access Denied" on writing to existing S3 bucket.
{code:java}
import s3fs
import pyarrow as pa
import pyarrow.parquet as pq

print(pa.__version__)

dff = pd.DataFrame({'Par1': ['a','b','c']*8, 'Par2': ['x','y']*12, 'Par3': range(24)})
path = 's3://bucket/path'

s3 = s3fs.S3FileSystem()
table = pa.Table.from_pandas(dff)

# This will pass
pq.write_table(table, path + '/table.parquet', filesystem=s3)

# This will fail: An error occurred (AccessDenied) when calling the CreateBucket operation: Access Denied
pq.write_to_dataset(table, path + '/table_partitioned.parquet', filesystem=s3, partition_cols=['Par1', 'Par2']) {code}
In 7.0.0 the code runs without errors.

Best regards

Vladimir



--
This message was sent by Atlassian Jira
(v8.20.10#820010)